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NOVEL MOLECULES 



FIELD OF THE INVENTION 

5 The present invendon nJated generally to novel molecules and more particularly novel 
proteinaceous molecules involved in or associated with regulation of ceU activities and/or 
viability. The present invention is particularly directed to novel serine proteinases and anovel 
kinase and to derivatives, agonists and antagoiusts thereof. In one embodiment, the present 
inventionprovidesanovelserineproteinase. referred to herein as -HELA2" or "^^^^^^ 

10 has roles in spermatogenesis, in suppressing testicular cancer and as a marker for cancers. 

BACKGROUND OF THE INVENTION 

The rapidly increasing sophistication of recombmant DNA technology is greaUy facilitating 
15 ^handde>«k,pmentinthemedicalandamedhealti»fields.TO^ 

the area of cell regulation leading to a greater understanding of tiie events leading to or involved 

tocancer. development ofacquiredimmunodefxciency disease syndroms 

disorders, heart disease, tissue graft rejection and infertility amongst many otiier conditions. 

20 Twoparticularlyimportantclassesofmoleculesaredieproteinasesandkinase^^ 

Proteinases play important roles in a number of physiological and patirological processes such 
as proteolytic cascades invoh«d in blood coagulation, fibrinolysis and complement activation as 
well as cleavage of growtii factors, hormones and receptors, the release of bioactive molecules 
25 and processes involving cell proliferation and development, inflammation, tiimour growti. and 
n^tastasis. Of particular significance are tiie cellular proteinases, or tiiose protemases 
syntiiesized in ceUs and tissues which serve to activate or deactivate proteins responsible for 
performing specific functions. Tliese proteina^ may be found outside the cell, within ti« cell 
or may be present on the cell surface. 

Serine proteinases are particularly important. These proteinases are characterrsed by a 
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n^chanism involving serine, hisddine and aspartate amino acids in the serine proteinase active 
site Members of the serine proteinase family which play important roles in a range of cellular 
functions and which have demonstrated causative roles in human diseases include tissue-type 
plasminogen activator and thrombin (thrombosis and blood clotting), urokinase-type plasminogen 
5 activator (cancer and metastasis), trypsin and elastase (emphysema and Uver disease) and 
angiotensin converting enzyme (hypertension). 

A serine proteinase is also implicated in TNFa degradation and soluble TNF-receptor (p75) 
release by THPl ceUs (Vey et al Eur. J. /mm. 26, 2404-2409, 1996). Serine proteinases have 
10 been implicated in the acth^tion of macrophages (Nakabo et al J. Uukocyte Biol 60, 328-336. 
1996) in nuclear laminin degradation in apoptosis (McConkey .fa/. J.Biol Chem.. 271, 22398- 
22406 1996), in prostaglandin-E2 induced release of soluble TNF receptor shedding (Choi et 
al. CelMar Immunology 170. 178-184. 1996), in PAF synthesis (BussoUno et al Eur. J. 
ImnumoL 24. 3131-3139. 1994). and in the proteolysis of DcB, a regulatory molecule important 
15 insignaltnB«ductionandapoptosis.Rek«seofserineprote^ 

to CTL killing and many of the substrates cleaved by granzymes are also cleaved by ceUular 
p^teinases (for example. II. 1 P is a substrate for Granzy me B as weU as d« cysteme proteinase, 
interleukin 1 P-converting enzyme GCE)). Granzyme A. a serine proteinase with Arg-amidolytrc 
activity, has been reported to induce the production of IL-6 and IL-8 in lung fibroblasts (Sower 
20 etal. CelMar immunology 171. 159-163. 1996) and cleaves IL-iP to a 17kD mature form that 

is biologically active. 

Kinases are a large group of molecules, many of which regulate the response of cells to external 
stimuli These molecules regmate proliferation and differentiation in eukaryotic cells ftequentiy 
25 via signal transduction pathways. 

The identification of new serine proteinases and kinases permits the development of a range of 
derivatives, agonists and antagonists at the nucleic acid and protein levels which in turn have 
applfcations in the tieatinent and diagnosis of aiange of conditions such as cancer, inflammation. 
30 neurological disorders amongst many other conditions including conditions which initiate or 
promote apoptosis such as viralinfix:tion.okl age and dmg abuse. One particul^^^ 
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proteinase HELA2 (testisin) identified in accordance with the present invention is involved in 
spermatogenesis, testicular cancer and as a marker for cancer. 

SUMMARY OF THE INVENTION 

5 

Sequence Identity Numbers (SEQ ID NOs.) for the nucleotide and amino arid sequences referred 
to in the specification are defined at die end of the subject specification. 

Throughout this specification and the claims which foUow. unless the context requires otherwise, 
10 the word "comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated integer or group of integers but not the exclusion of any other 
integer or group of integers. 

One aspect of the present invention provides a novel molecule in isolated form involved in or 
15 associated with regulation of cell activity and/or viability. 

Another aspect of the present invention contemplates an isolated proteinaceous molecule 
involved in or associated with regulation of cell activity and/or viabUity comprising a sequence 
of amino acids encoded by a nucleotide sequence, at least a portion of which, is capable of being 
20 ampUfied by polymerase chain reaction (PGR.) using the following primers: 

5' ACAGAATrCTGGGTIGTIACIGaGaCAYTGS' [SEQ ID NOrl]; and 
5' ACAGAATTCAXIGGICaCCiaGT/AXTaCCS' [SEQIDN0:21; 

25 

or a complementary form of said prinners. 

The proteinaceous molecule of tiie present invention may be a serine proteinase or a kinase. 

30 Yet anotiier aspect ofthe present invention is directed to an isolated serine ^^^^ 

the amino acid sequence substantially set forth in SEQ ID N0:4 or an amino acid sequence 
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having at least about 50% similarity to all or part thereof. This serine proteinase is refened to 
herein as a short isofonn (S) of "HEU^" or "testisin". 

Still another aspect of the present invention relates to an isolated serine protemase comprising 
5 the amino acid sequence substantially as set forth in SEQIDN0:6 or an amino acid sequence 

having at least about 50% similarity to all or part thereof. TTus serine proteinase is referred to 
herein as a long isoform (L) of HELA2 (testisin). 

Still yet another aspect of the present invention provides an isolated serine proteinase comprising 
10 an amino acid sequence substantially as set forth in SEQ ID N0:8 or an ammo acid sequence 
having at least about 50% similarity to all or part thereof. This serine proteinase is referred to 
herein as "ATC2". 

Evenyet another aspect ofthepresent invention isdiiectedtoa^rinep^^^^ 

15 compri^pgasequenceofaminoacidsencodedbyanucleot^ 

in SEQ IDNaSoranucleotidesequencehaving at least 50% similarity to all or part thereof or 

anucleotide sequence capable of hybridising to the sequence set forthin SEQ ID N0:3und. 
low stringency conditions at 42**C. 

20 Anoth^aspectofthepresentinventionrelatestoaserineprotein^ 

a sequence of amino acids encoded by a nucleotide sequence substantially as set forth m SEQ 
ID NO-5 or a nucleotide sequence having at least 50% sunihrity to all or part thereof or a 
nucleotide sequencecapableofhybridising to the sequence set forth in SEQmNO^ 

Stringency conditions at 42°C. 

Still anotixer aspect of ti.e present invention provides a serine proteinase in isolated form 
comprisingasequenceof amino acids encodedbyanucleotide sequence substantially as setfo^ 

inSEQIDNO:? or a nucleotide sequence having at least 50% similarity to all or part thereof or 
anucleotide sequence capable of hybridising to die sequence set forth in SEQmN0:7^^^^ 

30 low Stringency conditions at 42"C. 
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Another embodiment of the present invention is directed to a kinase in isolated fonn comprising 
an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 50% amino acid 
similarity to all or part thereof. This kinase is referred to herein as "BC0N3". 

5 In a related embodiment, the kinase comprises an amino acid sequence encoded by a nucleotide 
sequence substantially as set forth in SEQ ID N0:9 or a nucleotide sequence having at least 50% 
simUarity to all or part of the nucleotide sequence set forth in SEQ ID NO:9 or a nucleotide 
sequence capable of hybridising to tiie nucleotide sequence set forth in SEQ ID N0:9 under low 
stringency conditions at 42°C. 

10 

The present invention fiirther provides an isolated nucleic acid molecule encoding a polypeptide 
wherein at least a portion of said nucleic acid molecule is capable of being amplified by 
polymerase chain reaction (PC31) using the following primers: 



15 



5' ACAGAATTCTOGGTIGTIAaGaGaCAYrG3' [SEQ ID NO:l]; and 
S'ACAGAATTCAXIGGICacaaGT/AXTaCCS' [SEQ ID NO:2]; 
or a complementary form of said primers. 

20 

The present invention also provides an isolated nucleic acid molecule comprising a sequence of 
nucleotides substantially as set forth in SEQ ID N0:3 or having 50% similarity to aU or part 
tiiereof or a nucleic acW molecule capable of hybridising to SEQ ID N0:3 under low stringency 
conditions at ATC. 

25 

AnotiKJT aspect of tiie present invention provides an isolated nucleic acid molecule comprising 
a sequence of nucteotides substantiaUy as set forth in SEQ ID N0:5 or having 50% similarity to 
all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:5 under low 
stringency conditions at 42°C. 

30 

Still another aspect of the present invention is directed to an isolated nucleic acid molecule 
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comprising a sequence of nucleotides substantially as set forth in SEQ ID N0:7 or having 50% 
similarity to all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:7 
under low stringency conditions at 42'C. 

5 Even stiU another aspect of the present invention provides an isolated nucleic acid molecule 
comprising a sequence of nucleotides substantially as set forth in SEQ ID N0:9 or having 50% 
similarity to aU or part tiiereof or a nucleic acid molecule capable of hybridising to SEQ ID NO:9 
under low stringency conditions at 42°C. 

10 Anotijer aspect of the present invention provides an isolated serine proteinase encoded by a gene 
proximal to a cluster of genes on a manunalian chromosome. 

More particularly, tins aspect of tiie present invention is directed to a serine proteinase encoded 
by a gene proxmial to a cluster of genes or human chromosome 16pl3.3 or its equivalent in a 
15 non-human species. 

Still more particulariy. die serine proteinase is encoded by a gene comprising a nucleotide 
sequence substantially as set forth in SEQ ID N0:3 or 5 or 28 or 29 or 30 or a nucleotide 
sequence having at least 50% simihirity to any one tiiereof or a nucleotide sequence capable of 
20 hybridizing to any one of SEQ ID NO:3 or 5 or 28 or 29 or 30 under low stringency conditions 
at 42'C or a nucleotide sequence encoding a serine proteinase having an amino acid sequence 
substantially as set forth in SEQ ID N0:4 or 6 or an amino acid sequence having at least about 
50% similarity to SEQ ID NO:4 or 6. 



25 
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BRDEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a representation showing (A) schematic and (B) hydiophobicity plot of the HELA2 
amino acid sequence. 

5 

Figure 2 is a diagrammatic representation showing: (A) tiie amino acid sequence of HELA2 
(testisin). The putative signal sequence, light chain, heavy chain and transmembrane domains are 
as indicated, tbt catalytic amino acids. His. Asp and Ser are as designated; insertion of Tyr-Ser 
(YS) 4 amino acids after the catalytic His is found in tiie long isoform of testisin and is due to 
10 alternative mRNA splicing; (B) in vitro transcriptionAranslation of HELA2 (testisin) showing 
the protein product. 

Figure 3 is a diagrammatic representation of plasmid constructs pBluescriptHELA2(S) and 
pBluescriptHELA2(L) containing full length cDNAs for testisin (short isoform (S)) and testisin 
15 (long isoform (L)), respectively. 

Figure 4 is a diagrammatic representation of plasmid constructs pQET(20-295)N and pQET(20- 
295)C. wherein the hydrophobic residues of testisin were removed and tiie remaining sequences 
cloned into pQE prokaryotic expression plasmids; plasmids pGEX-1 (90-279) comprising a 
20 carboxy terminal part of testisin fused to glutatiiione-S-transferase. 

Figure 5 is a photographic representation of: (A) sUver stained gel showing purification of 
recombinant HELA2 (testisin) from E. coli. The purified HELA2 (testisin) is indicated by tiie 
arrow in tiie ehiate fractions. Some HELA2 (testisin) is also found in the wash fractions as tiie 

25 affinitymatrixwasnotusedmexcess. His-N21 is one clone containing tiieamino-terminal His 
tag. and clones His-C21, His-C22 and His-C23 are tiiree different clones witii tiie carboxy- 
terminal His tag. (B) Western blot of native and denatured recombinant HELA2 (testisin) 
probed widi Clontech anti-Ms tag-antibody. The 321d> band shown by tiie arrow is HELA2 
(testisin). HELA2 (testisin) is not detected in tiie denatured samples as it appears tiiat 

30 denaturation witii urea destroys tiie His epitope recognised by tiie monoclonal antibody. 
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Figure 6 is a representation of the amino acid sequence of HELA2 (testisin) showing the regions 
of the molecule selected for generation of peptide antigens. 

Figure 7 is a photographic representation of a Western blot of GST-HELA2 (testisin) fusion 
5 protein purified by affinity chromatography. 

Figure 8 is a diagrammatic representation of eukaryotic expression constructs, pcDNA3-Test(S- 
C), pcDNA3-Test(L-C) and pcDNA3-Test(l-297)L-C. 

10 Figure 9 is a diagrammatic representation showing a histogram of the signal intensity from a 
Clontech Master RNA blot of the tissue distribution of HELA2 (testisin) in RNA from 50 
different normal tissues. (A) Probed with HELA2 (testisin) specific piobe; (B) Probed with 
BCX)N3 specific prove which is ubiquitously expressed. The 8 tissues on the right hand side of 
tiie diagram arc the control (negative) samples. 

15 

Figure 10 is aphotographic representation of a multiple normal tissue Nortiiem blot (Clonetech) 
probed witii: (A) HELA2 (testisin) specific probe and (b) BCX)N3 specific probe. 

Figure 11 is a photographic representation of agarose gel of PGR products generated by 
20 amplification of HELA2 (testisin) cDNA in prevasectomised and post-vasectomised ejaculate 
specimens. The HELA2 (testisin) PGR product is 464bp and the p2-macroglobulin product is 
250 bp. 

Figure 12 is a photographic representation of in situ hybridization of rat testis showing tht 
25 localisation of HELA2 (testisin) mRNA to tiie germ cells of tiie testis. 

Cigpre 13 is a representation showing: (A) spread of normal metaphase chromosomes showing 
bright dots where HELA2 (testisin) is expressed at 16pl3.3; (B) Diagrammatic representation 
of chromosome 16pl3.3 showing location of HELA (testisin) and relationship to other disease 
30 causing genes. 
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Rgure 14 is: (A) a photographic representation of northern blot analysis of HELA2 (testisin) 
mRNA showing signals in normal testis of 4 patients and absence of signal in flie tumours of 
these patients; (B) aphotographic representation of tiie localisation of HELA2 (testism) protein 
in a human germ cell tumour section assessed by immunohistochemical staining using anti- 
5 HELA2 (testisin) peptide antibodies. Staining is only detected in tiie normal (N) tissue and not 
present in the tumour (T) tissue. 

Figure 15 is a diagrammatic representation of the genomic map of HELA2 (testisin) showing 
experimentally determined intron/exon boundaries and relative sizes of the intix>ns (marked witii 
10 a letter) and exons (marked witii a roman numeral). 

Figure 16 is a representation of HELA2 (testisin). Nucleotides in introns are in lowercase and 
exons in uppercase. The putative transcription start site is marked by +1. 

15 Figure 17 is a representation of tiie DNA sequence of Intron C and flanking exons showing 
where alternative mRNA splicing occurs to generate tiie two isoforms of HELA2 (testisin). 

Figure 18 is a representation of: (A) tiie cDNA sequence of the mouse homologue of HELA2 
(testisin). Catalytic residues are indicated by circles and cysteines likely involved in disulfide 
20 bonding are indicated by squares; (B) Hydrophobicity plot of HELA2 (testisin) amino add 
sequence. 

Figure 19 is a diagrammatic representation of chromosome 16pl3.3 showing tiie serine 
proteinase gene cluster which includes HELA2 (testisin). Unes represent cosmids containing 
25 tiie respective serine proteinase genes. 

Figure 20A is a representation of: (A) tiie cDNA sequence of SPOOILA (SEQ ID NO:28). 
Catalytic residues are indicated by circles and cysteins likely involved in disulfide bonding are 
indicated by squares; (B) hydrophobicity plots of SPOOILA amino acid sequence. 

30 

Figure 20B is a representation of: (A) tiie cDNA sequence of SP002LA (SEQ ID NO:29). 
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Catalytic residues arc indicated by circles and cysteines likely involved in disulfide bonding are 
indicated by squares. (B) Hydrophobicity plot of SP002LA amino acid sequence. 

Figure 20C is a representation of: (A) the cDNA sequence of SP003LA (SEQ ID NO:30). 
5 Catalytic residues arc indicated by circles and cysteines likely involved in disulfide bonding are 
indicated by squares. (B) Hydrophobicity plot of SP003LA amino acid sequence. 

Figure 21 is a photographic representation of in vitro transcriptionAranslation of BC0N3 
showing the protein products. 
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A summary of the SEQ ID NOs used throughout the specification is presented in Table 1. 

TABLE 1 



15 



20 



SEP IP NO DESCRimON 



1 * PGR primer sequence 

2 * PGR primer sequence 

3 Nucleotide sequence of short form of HELA2 

4 Amino acid sequence of short form of HELA2 
jQ 5 Nucleotide sequence of long form of HELA2 

6 Amino acid sequence of long form of HELA2 

7 Nucleotide acid sequence of ATC2 
g Amino acid sequence of ATC2 

9 Nucleotide acid sequence of BCOM3 

10 Amino acid sequence of BCOM3 

1 1 Primers used to generate amino terminal 
tagged protein 

12 Primers used to generate amino terminal 
tagged protein 

13 Primers used to generated carboxy-linked 
temunal protein 

14 Primers used to generated carboxy-linked 
tenninal protein 

15 Peptide antigen T20-33 

16 Peptide antigen T46-63 

17 Peptide antigen T175-190 

18 Forward primer 

19 Reverse primer 
25 20 Forward primer 

21 Reverse primer 
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TABLE 1 (Continued) 



5 



10 



22 


Forward primer 


23 


Reverse {vimer 


24 


Serine proteinase activation motif 


25&26 


Mouse HELA2 cDNA sequence 


27 


Human genomic DNA sequence 


28 


Clustered serine proteinase gene SPOOILA 


29 


Clustered serine proteinase gene SP002LA 


30 


Clustered serine proteinase gene SP003LA 



* Abbreviations: 

15 



X= AorG 
Y=CorT 
1= Inosine. 
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A list 



of single and three letter abbreviations for amino acid residues is presented in Table 2. 



TABLE 2 



5 Amino Acid 


Three-letter 


One-letter 




Abbreviation 


Symbol 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


10 Asparagine 


Asn 


N 


Aspartic acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamine 


Gin 


Q 


Glatamic add 


Glu 


E 


IS Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


He 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


20 Metiiionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser ■ 


S 


Threonine 


Thr 


T 


25 Tryptophan 


Trp 


W 


Tyro»ne 


Tyr 


Y 


Valine 


Val 


V 


Any residue 


Xaa 


X 



30 
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DETTAILED DESCMPTION OF PREFERRED EMBODIMENTS 

The present invention is piedicated in part on a genetic engineering approach to identify 
mK:leotidesequencesencodingserineproteinasesorkinases. The genetic engineering approach 
5 is based on the use of degenerate primers corresponding to conserved regions of serine 
proteinases (amino acids flanldng His- and Ser- residues) to amplify gene fragments spanning 
these regions for cDNA. using low stringency reverse transcriptase-polymerase chain reaction 
(RT-PCR). 

10 This technique has been successfully used, in accordance with die present invention, to identify 
serine proteinases and kinases useful in modulating cell activity and viabiUty including modulating 
spermatogenesis, acting as tumour suppressors and acting as a marker for non-testicular cancers. 

Accordingly, one aspect of tiie present invention provides a novel molecule in isolated form 
15 involved in or associated witii regulation of cell activity and/or viability. 

More partfcularly. tiie present invention contemplates a novel serine proteinase m isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence, at least a portion of 
which, is capable of being amplified by polymerase chain reaction (PGR) using tiie following 
20 {Himers: 

5' ACAGAATrcTGCX5TIGTIAaGaGCICAYTG3' [SEQ ID NO:ll; and 
5'ACAGAATrCAXIGGICacaaGT/AXTaCC3' [SEQ ID N0:21; 

25 

or a complementary form of said primers. 

Preferably, X is A or G. Y is C or T and I is inosine. 

30 Inaparticularly preferred embodiment, the isolated serine proteinase comprises the amino acid 
sequence substantially set forth in SEQ ID NO:4 or an amino add sequence having at least about 
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50% sinularity to all or part thereof. This serine proteinase is referred to herein as a short 
isoform of ■•HELA2" or "HELA2 (testisin)". The terms "HH^" and "testisin" are used 
uiterchangedly throughout the subject specification to refer to the same molecule. 

5 In another preferred embodiment, the amino acid sequence of the serine proteinase is 
substantially as set forth in SEQ ID NO:6 or an amino acid sequence having at least about 50% 
similarity to aU or part thereof This serine proteinase is the long isoform of HELA2 or HELA2 
(testisin). 

10 Yet another preferred embodiment of the present invention provides an amino acid sequence 
substantially as set forth in SEQ ID N0:8 or an amino acid sequence having at least about 50% 
simUarity to aU or part thereof. This serine proteinase is referred to herem as "ATCr. 

Another aspect of the present invention relates to a serine proteinase in isolated form comprising 
15 a sequence of amino acids encoded by a nucleotide sequence substantiaUy as set forth in SEQ 
ID N0:3 or a nucleotide sequence having at least 50% similarity to all or part thereof or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID NO:3 under low 
stringency conations at 42°C. 

20 Still another aspect of the present mvention is directed to a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantially as set forth 
in SEQ ID N0:5 or a nucleotide sequence having at least 50% simUarity to all or part thereof or 
a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID NO:5 under 
low stringency conditions at 42°C. 

25 

In another aspect of the present invention, there is provided a serine proteinase in isolated form 
comprising a sequence of amino acids encoded by a nucleotide sequence substantiaUy as set forth 
in SEQ ID NO:7 or a nucleotide sequence having at least 50% simUarity to all or part thereof or 
a nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID N0:7 under 
30 low stringency conditions at 42"C. 
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Another embodiment of the present invention is directed to a kinase in isolated form comprising 
an amino acid sequence substantiaUy as set forth in SEQ ID NO: 10 or having 50% amino acid 
similarity to all or part thereof. This kinase is leferred to herein as "BC0N3". 



5 inarelatedenil»diment.thekinasecomprisesanaminoacidsequenceen^ 

sequence substantially as set forth in SEQ ID NO:9 or a nucleotide sequence having at least 50% 
similarity to all or part of the nucleotide sequence set forth in SEQ ID N0:9 or a nucleotide 
sequence capable of hybridising to the nucfeotide sequence set forth in SEQ ID N0:9 under low 
stringency conditions at 42°C. 



10 



The present invention fiirther provides an isolated nucleic acid molecule comprising a sequence 
of nucleotides encoding or complementary to a sequence encoding a novel molecule involved 
in or associated with regulation of cell activity and/or viability. Preferably, the nucleic add 
molecule is capable of bemg amplified by PGR using the primers set forth in SEQ ID N0:1 
15 and/or SEQ ID NO:2. 

More particukiriy. the present invention further provides an isolated nucleic acid molecule 
comprising a sequence of nucleotides substantially as set forth in SEQ ID NO:3 or having 50% 
similarity to all or part thereof or a nuctek: acki mokcute capable of hybridising to SEQ ID N0:3 
20 under low stringency conditions at 42°C. 

Another aspect of the present invention provides an isolated nucleic acid molecule comprising 
a sequence of nucleotides substantiaUy as set forth in SEQ ID N0:5 or having 50% similarity to 
all or part thereof or a nucleic acid molecule capable of hybridising to SEQ ID N0:5 under low 
25 stringency conditions at 42**C. 

Another aspect of the present invention is directed to an isolated nucleic acid molecule 
comprising a sequence of nucleotides substantially as set forth in SEQ ID N0:7 or having 50% 
similarity to all or part thereof or a nuclei ackl molecule capable of hybridising to SEQ ID N0:7 
30 undra: low stringency conditions at 42°C 
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Still another aspect of the present invention pn>vides an isolated nucleic acid molecule 
conpisingasequence of nucleotides substantiaUy as set forth in SEQmN0:9 or hav^^^ 

«„rilarityto anor part thereof oranucleic acid nK,lecukcap*^^ 
under low stringency conditions at AVC. 

' Reference herein to alow stringency includes low stringency at42»C includes and encompasses 
from at least about 1% v/v to at least about 15% v/v formamide and from at least about IM to 
at least about 2M salt for hybridisation, and at least about IM to at least about 2M salt for 
washing conditions. Alternative stringency conditions may be appUed where necessary, such as 

10 medium stringency, which includes and encompasses from at least about 16% v/v to at least 
about 30% v/v formamide and from at least about 0.5M to at least about 0.9M salt for 
hybridisation, and at least about 0.5M to at least about 0.9M salt for washing conditions, or high 
stringency, whichincludes and encompasses from at least about 31% v/v to at least about 50% 
v^ formamide and from at least about O.OIM to at least about 0. 15M salt for hybridisation, and 

15 at least about O.OIM to at least about 0.15M salt for washing conditions. 

Reference herein to simihttity to "part" of a sequence means simUarity to at least about 4 
contiguous amino acids or at least about 12 contiguous nucleotide bases and more preferably at 
least about 7 contiguous amino acids or at least about 21 contiguous nucleotide bases. 



20 



The term "similarity" includes exact identity between sequences or, where the sequence differs, 
different amino acids may be related to each otiier at the structural, fiinctional, biochemical 
and/or conformational levels. 

25 The term "isolated" includes biological purification and biological separation and encompasses 
molecules having undergone at least one purification, concentration or separation step relative 
to its natural environment. For example, a preparation may comprise at least about 10%, 
preferably at least about 20%, more preferably at least about 30%. stiU more preferably at least 
about 50% or greater of tiie molecule relative to at least one otiier component in a composition 

30 as detemiined by activity, mass, amino acid content, nucleotide content or otiier convenient 
means. 
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Heicinafter. the inolecules of the present invention are lefened to as a "proteinase/kinase". The 
term "proteinase/kinase" includes the serine proteinases HELA2 (testisin) and ATC2 and the 
kinase BC0N3. The proteinase/kinase of the present invention may be in isolated, naturally 
occurring form or recombinant or synthetic form or chemical analogues thereof. 

5 

The proteinasetoiase of the present invention is preferably of human origin but from non-human 
origins are also encompassed by the present invention. Non-human animals contemplated by tiie 
present invention include primates. livestock animals (e.g. sheep, cows. pigs, goats, horses, 
donkeys), laboratory test animals (e.g. mice. rats, guinea pigs, hamsters, rabbits), domestic 
10 companion animals (e.g. dogs. cats), birds (e.g. chickens, geese, ducks and otiier poultry birds, 
game birds, emus. ostrKhes) and captive wiW or tamed animals (e.g. foxes, kangaroos, dingoes). 
The present invention also encompasses a proteinase/kinase homologue from Xenopus and 
plants. 

1 5 Tte nucleic acW molecules encoding a proteinase/kinase may be genomic DNA. cDNA or RNA 
such as mRNA. 

Yet anotiier aspect of tiie present invention provides an isolated serine proteinase encoded by a 
gene proximal to a cluster of genes on a mammalian chromosome. The cluster of genes is 

20 preferablyonhumanchronK>somBl6pl3.3oritsequivalentinanon-humanspecies. Thecluster 
is made up of genes all encoding or having the potential to encode a serine proteinase or 
homologue. derivative or functional or evolutionary equivalent tiiereof. Preferably, tiie gene 
cluster comprises two or more of genes comprising a nucleotide sequence selected from SEQ 
ID N0:3 and 5 (HELA2. short and long forms, respectively) and SEQ ID NO:28 (SPOOILA). 

25 SEQ ID NO:29 (SP002LA). SEQ ID NO:30 (SP003LA) and SP004LA (see Figure 19) or a 
nucleotide sequence havmg at least 50% similarity to any one of tiiose sequences or capable of 
hybridizing to any one of tiiose sequences under low stringency conditions at 4TC. 

The term "prDximal" is used in its broadest sense to mean a gpne cluster and includes a gene 
30 within proximity to another gene. 
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Another aspect of the present invention contemplates a method for cloning a nucleotide sequence 
encoding a novel serine proteinase, said method comprising screening a nucleic acid library with 
said one or more or oligonucleotides defined by SEQ ID N0:1 and/or SEQ ID N0:2 and 
obtaining a clone therefix)m which encodes said novel serine proteinase or part thereof. 

5 

Preferably, the nucleic acid library is genomic DNA. cDNA, genomic or mRNA Ubrary. 
Preferably, the nucleic acid library is a cDNA expression Ubrary. 

10 Preferably, the nucleic acid library is of human origin such as from brain, liver, kidney, neo-natal 
tissue, embryonic tissue, tumour or cancer tissue. 

With respect to HELA2 (HELA2 (testisin)), significant expression is generally only found in 
normal testis. Accordingly, die present invention extends to nucleic acid molecules capable of 
15 tissue-specific or substantially tissue-specific expression. 

Still another ernbodiment contemplates die promoter or a functional part tiiereof of die genomic 
gene encoding die subject proteinasefldnase of die present invention. The promoter may readUy 
be obtained by, for example, "chromosome walking". A particularly useful promoter is from 
20 HELA2 (testisin) which can be regarded as a testis specific promoter. This promoter can be 
used, for example, to direct testis specific expression of genetic sequences operably linked to die 
promoter and may be used inter alia gene dierapy or modulation of fertility. 

The present invention furdier contemplates a range of derivatives of die subject 
25 proteinase/kinase. Derivatives include fragments, parts, portions, mutants, homologues and 
analogues of die subject polypeptides and corresponding genetic sequences. Derivatives also 
include single or multiple amino acid substitutions, deletions and/or additions to die subject 
nwlecules or single or multiple nucksotide substitutions, deletions and/or additions to die genetic 
sequence encoding die molecuks. "Additions" to amino acid sequences or nucleotide sequences 
30 include tons widiodier peptides, potypeptides or proteins or fiisions to nucleotide sequences. 
Reference herein to tt» serine proteinase and kinase includes reference to aU derivatives diereof 
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including fiinctional derivatives or immunologicaUy interactive derivatives. 

Analogues of tiie subject serine proteinase and kinase contemplated herein include, but are not 
limited to. modification to side chains, incorporating of unnatural amino acids and/or flieir 
5 derivatives during peptide, polypeptide or protein syntiiesis and the use of crossUnkers and other 
methods which impose conformational constraints on ttie proteinaceous molecule or their 
analogues. 

Examples of side chain modifications contemplated by the present invention include 
10 modifications of amino groups such as by reductive alkylation by reaction with an aldehyde 
followed by reduction mih NbBH^; amidination with metiiylacetimidate; acylation with acetic 
anhydride; cari)amoyh»tion of amino groups with cyanatc; trinitrobenzylation of amino groups 
with 2. 4, 6-trinitiX)benzene sulphonic acid (TNBS); acylation of amino groups with succinic 
anhydride and tetrahydrophtiialic anhydride; and pyridoxylation of lysine witii pyridoxal-5- 
15 phosphate followed by reduction witii NaBH4. 

The guanidine group of arginine residues may be modified by the formation of heterocycUc 
condensation products witii reagents such as 2,3-butanedione, phenylglyoxal and glyoxal. 

20 The carboxyl group may be modified by caibodumide activation via O-acylisourea formation 
followed by subsequent derivitisation. for example, to aconesponding amide. 

Sulphydryl groups may be modified by methods such as catboxymetiiylation witii iodoacetic acid 
oriodoacetamide; performic acid oxidation to cysteic acid; formation of a mixed disulphides 
25 with other tiiiol compounds; reaction witii maleimide. maleic anhydride or other substituted 
maleimide; formation of mercurial derivatives using 4-chloromercuribenzoate. 4- 
chtoromeicuriphenylsulphonic ackl, phenylmercury chloride. 2-chloromercuri-4-nitrophenol and 
otiiw mercurials; carbamoylation witii cyanate at alkaline pH. 

30 Tryptophan residues may be modified by, for example, oxidation witii N-bromosuccimmide or 
alkylation of tiie indole ring witii 2-hydroxy-5-nitroben2yl bromide or sulphenyl halides. 
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Tyrosine residues on the other hand, may be altered by nitration with tetranitromethane to fonn 
a 3-nitrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be accomplished by alkylation with 
5 iodoacetic acid derivatives or N-caibethoxylation with diethylpyrocarbonate. 

Examples of incorporating unnattiral amino acids and derivatives during peptide synthesis 
include, but are not limited to. use of norleucine. 4-amino butyric acid. 4-amino-3-hydroxy-5- 
phenylpentanoic acid. 6-aminohexanoic acid, t-butylglycine. norvaline. phenylglycine. ornithine. 
10 sarcosine. 4.amino-3-bydroxy-6-methylheptanoic acid. 2-thienyl alanine and/or D-isomers of 
amino acids. A Ust of unnatural amino acid, contemplated herein is shown in Table 3. 
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TABLE3 



Non-conventional 


Code 


Non-<H>nventional 


cooe 


amino acid 




amino acid 




a-aminobutyric acid 


Abu 


L-N-methylalanine 


iNmaia 


a-amino-oc-methylbutyrate 


Mgabu 


L-N-methylargininc 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


10 aminoisobutyric acid 


Aib 


I^N-methylcysteine 


Nmcys 


aminonorbomyl- 


Norb 


L-N-methyiglutamine 


Nmgln 


caiboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohcxylalaninc 




Chcxa L-N-methylhistidine 


Nndus 


cyclopentylalanine 


Cpcn 


LrN-methylisolleucine 


Nmile 


15 D-alanine 


Dal 


LrN-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-mediyllysine 


Nmlys 


D-aspaitic acid 


Dasp 


L-N-methylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 


D'glutamine 


Dgln 


L-N-methyborvaline 


Nmnva 


20 D-glutamic acid 


Dglu 


LrN-methylomithine 


Nmom 


D-histidine 


Dhis 


l^N-methylpbenylalanine 


Nmphe 


D-isoleucinc 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


NmsCT 


D-lysine 


Dlys 


L-N-methylthreonine 


iNmtnr 


25 D-methioninc 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-omithine 


Dom 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycinc 


Nmetg 


]>serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


30 D-thieonine 


Dthr 


L-norleucine 


Nle 


D-tryptophan 


Dtrp 


L-norvaline 


Nva 



- wo 98/36054 



PCT/AU98/00085 



-23- 



Tv.tvfosine 


Dtyr 


a-mcthyl-aminoisobutyrate 


Maib 


TV.v!i1itlP 


Dval 


a-methyl-y-aminobutyrate 


Mgabu 


n /v.mpthvlalaiiine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


/V m^^thvlarpininc 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 




Dmasn 


a-methyl-a-nJ5)thylalanine 


Manap 


m<*tVl\/lQCn5irt5ltft 

jL/-u-niciflyi<i2>p<" uiit 


Dmasp 


a-methylpenicillaimne 


Mpen 


U- tt-iiieiny icy j>icnic 


Dmcys 


N-(4-aminobutyl)glycine 


Nglu 


Pv m^tVivlcrliitsiminft 

lj-iX-ni6inyigiuuuiiiii& 


Dmgln 


N-(2-aininoethyl)glycine 


Naeg 


Ll-tt-niClliyiiiiauuJiiC' 


Dmhis 


N-(3-aminopropyI)glycine 


Norn 


1A "P\ /w lYi^thvlfcnlAiicinc 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 


r\ /v mAtHvll murine 


Dmleu 


a-napthylalanine 


Anap 


U-tt-IuCUiyiiyaiuc 


Dndys 


N-benzylglycine 


Nphe 




Dmmet 


N-(2-caibamylethyl)glycine 


Ngta 


T\ M «vu»t'l^if1nmitliff1P 

j^-tt-nicuiyiuriiiuiiuw 


Dmom 


N-(carbamylinethyl)glycine 


Nasn 


1 c M m^tVii/lrkhmvl alanine 


Dmphe 


N-(2-caiboxyethyl)glycine 


Nglu 


T^/v mptVivlnmlinfi 


Dmpro 


N-(caiboxymethyl)glyciiie 


Nasp 


#v mpflivlcprinp 
u- K-niciiiy 12SC1 uiC' 


Dmser 


N-cyclobutylglycine 


Ncbut 


jL/~" tt-niciiiy luu ciiiuii^ 


Dmthr 


N-cycloheptylglycinc 


Nchep 


r\ r* m/^thvltTvntnnnan 
U-CC-inCUiy lu y pujpiiau 


Dmtrp 


N-cyclohexylglycinc 


Nchex 




Dmty 


N-cyclodecylglycine 


Ncdec 


Fi_ /V -mpth vl val ine 


Dmval 


N-cylcododecylglycine 


Ncdod 


r^-KI mpthvljilaninft 
l^iN ~iucuiy laKUUiit' 


Dnmala 


N-cyclooctylglycine 


Ncoct 


ri-M-methvlarffinine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 


D-N-methvlasDaraeinc 


Dnmasn 


N-cycloundecylglycine 


Ncund 


95 D-N-niethvlasDartate 


Dnmasp 


N-(2.2-diphenylethyl)glycine 


Nbhm 


D-N-methylcysteine 


Dnmcys 


N-(33-diphenylpropyl)glycine 


Nbhe 


D-N-metbylgiutamine 


Dningln 


N-(3-guanidinopropyl)glycine 


Narg 


D-N-methylglutamate 


Dnmglu 


N-(l-hydroxyethyl)glycine 


Nthr 


D-N-methylhistidine 


Dnmhis 


N-(hydrbxyethyl))glycine 


Nser 


30 D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl))glycine 


Nhis 


D-N-methylleucinc 


Dnmleu 


N-(3-indolyIycthyl)glycine 


Nhtrp 
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D-N-methyllysine 


Dnmlys 


N-methyl-Y-aminobutyrate 


Nmgabu 


N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylomithine 


Dnmom 


N-methylcyclopentylalanine 


Nmcpen 


N-methvlelvcine 


Nala 


D-N-methylphenylalanine 


Dmnphe 


S N-methvlaminoisobutyrate 


Nmaib 


D-N-methylprolinc 


Dnmpro 


N-f 1 -methvlpropyl)glycine 


Nile 


D-N-methylseiine 


Dnmser 


N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-(l-methylethyl)glycine 


Nval 


D-N-methvltvrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


10 D-N-methvlvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


T ^f-hutvlslvcine 

Xi^w \j\»^y ^j^^j 


Tbug 


N-(thiomethyl)glycine 


Ncys 


T ^thvlelvcine 


Etg 


penicillamine 


Pen 


I ^homoDhenvlalanin& 


Hphe 


L-a-methylalanine 


Mala 


15 L^ct-methvlareinine 


Marg 


I^a-mcthylasparagine 


Masn 


L-a-methvlaspartate 


Masp 


I^a-methyl-f-butylglycine 


Mtbug 


L-a-methylcysteine 


Mcys 


L-methylethylglycme 


Metg 


L-a-metfavlslutamine 


Mgln 


l^a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomophenylalanine 


Mhphe 


20 L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl)glycine 


Nmet 


L-a-methylleucine 


Mleu 


L-a-methyllysine 


Mlys 


L-a-methylmethionine 


Mmet 


L-a-methylnorleucine 


Mnle 


L-a-methylnorvaline 


Mnva 


L-a-methylomithine 


Mom 


L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 


25 LTa-methylserine 


Mser 


L-a-methylthreonine 


Mthr 


L-a-methyltryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 
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L-o-methylvaline Mval 
N-(N-(2,2-diphenylethyl) Nnbhm 
caibamylmethyOglycine 
l-caiboxy-l-(2,2-diphenyl- Nmbc 
5 ethylainino)cyclopropane 



Crosslinkers can be used, for example, to stabiUse 3D conformations, using homo-bifimctional 
crosslinkers such as the bifunctional imido esters having (CH2)n spacer groups with n=l to n=6, 

10 glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctionai reagents which usually 
contain an amino-reactive moiety such as N-hydroxysuccinimide and another group specific- 
reactive moiety such as maleimido or dithio moiety (SH) or caibodiimide (COOH). In addition, 
peptides can be confotmationally constrained by. for example, incorporation of C„ and I^- 
methylamino acids, intnxiuction of double bonds between C„ and Cp atoms of amino acids and 

15 the formation of cyclic peptides or analogues by introducing covalent bonds such as forming 
an amide bond between the N and C termini, between two side chains or between a side chain 
and the N or C terminus. 

These types of modifications may be in^witant to stabilise the proteinase/kinase if administered 
20 to an individual or for use as a diagnostic reagent 

The present invention further contemplates chemical analogues of the proteinase/kinase capable 
of acting as antagonists or agonists of the native molecules or which can act as fiuictional 
analogues of the native molecules. For example, an antagonist may be a proteinase inhibitor. 
25 Chemical analogues may not necessarily be derived from the subject enzymes but may share 
certain conformational similarities. Alternatively, chemical analogues may be specificaUy 
designed to mimic CMtain physiochemical properties of the serine proteinases or kinases. 
Chranical anatogues may be cheimcally synthesised or may be detected following, for example, 
natural product screening. 

30 

The identification of the novel molecules of the present invention permits the generation of a 



L-N-methylhomophenylalanine Nmhphe 
N-(N-(3,3-diphenylpropyl) Nnbhe 
caibamyhnethyl)glydne 
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range of therapeutic molecules capable of modulating expression of their naUve counterparts 
or modulating their activity. Modulators contemplated by the present invention includes 
agonists and antagonists of proteinaseAcinase expression. Antagonists of proteinase/kinase 
expression include antisense molecules, ribozymes and co-suppression molecules. Agonists 
5 include molecules which increase promoter ability or interfere with negative regulatory 
mechanisms. Agonists of proteinase/kinase include molecules which overcome any negative 
regulatory mechanism. Antagonists of tiie proteinase/kinase include antibodies and inhibitor 
peptide fragments. 

10 Otiier derivatives contemplated by the present invention include a range of glycosylation 
variants ftom a completely unglycosylated molecule to a modified glycosylated molecule. 
Altered glycosylation patterns may result fcm expression of recombinant molecules in different 
host cells. 

15 Anotiier embodiment of tiie present invention contemplates a metiiod for modulating 
expression of proteinascAdnase in a human, said metiiod comprising contacting tiie 
proteinaseAdnase gene encoding proteinaseAdnase witii an effective amount of a modulator of 
proteinaseAdnase expression for a time and under conditions sufficient to up-regulate or down- 
regulate or otiierwise modulate expression of proteinaseAdnase. For example, a nucleic acid 

20 molecule encoding proteinaseAdnase or a derivative ttiereof may be introduced into a cell 
conversely, proteinaseAdnase antisense sequences such as oUgonucleotides may be introduced. 

Another aspect of tiie present invention contemplates a metiiod of modulating activity of 
proteinaseAdnase in a human, said metiiod comprising administering to said mammal a 
25 modulating effective amount of a molecule for a time and under conditions sufficient to increase 
or decrease proteinaseAdnase activity. The molecule may be a proteinaceous molecule or a 
chemical entity and may also be a derivative of proteinaseAdnase or its receptor or a chemical 
analogue or truncation mutant of proteinaseAdnase or its recq)tor. 

30 One particularly usefiil serine proteinase. HELA2 (testisin). is implicated in spermatogenesis 
and in testicular tumour devetopnenL It is proposed, in accordance witii tiie present invention. 
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that HELA2 (testisin) is involved in fertility and infertility. 

Northern btot analysis of Poly A+ RNA from normal tissue specimens showed a unique tissue 
distribution for HELA2 (testisin) with significant expression only in tiie testis. No signals are 

5 detectedinanyothertissucwithtiKsexceptionofaminorsignalinsalivary gland. ByRT-PCR. 
HELA2 (testisin) is detected in tiie ejaculate of normal males but not in tiie ejaculate of 
vasectomised males indicating tiat it is of germ cell origin. Hybridization data in situ indicated 
tiiat HELA2 (testisin) is produced by immature germ cells in the testis, located near the basal 
epitiielium and. hence, is an important factor for normal sperm maturation; defective expression 

10 ormutationswouWcontribute to primary male infertiUty. Further, it is from the precursors of 
spermatocytes tiiat 95% of testicular germ cell tumours, such as seminomas, embryonal 
carcmomas and teratocarcinomas arise. In die normal testis, germ ceUs undergo meiosis to 
become spermatocytes, but in individuals at risk, the germ cells continue to proUferate giving 
rise to germcen tumours. Although not wishing to limit the present invention to any one theory 

15 or mode of action, it is proposed, in accordance with present invention, tiiat HELA2 (testisin) 
functions at this criticaljuncture-ceU growth versus maturation. 

Familial forms of testicular cancer are rare, but linkage analysis of a large family witii famOial 
seminoma has demonstrated linkage to chromosome 16p, within a region adjacent to tiie 
20 HPKDl (human polycystic kidney disease) gene at 16pl3.3. The HELA2 (testisin) gene 
localises to chromosome 16pl3.3 which is near tiie telomere of chromosome 16 and is 
associated witii high genetic instability. The HELA2 (testisin) gene is sandwiched between four 
genes which underlie otiier human genetic disorders; HPKDl and tuberous sclerosis (TSC2) 
on the one side, and familial mediterranean fever (MEF) and Rubenstein-Taybi syndrome 

25 (RSTS) on tiie otiier side. The question of whetiier HELA2 (testisin) may be a tumour 
suppressor for seminoma was determined by comparing HELA2 (testisin) mRN A expression 
in normal testes witti corresponding germ cell tumours from patients witii seminoma. HELA2 
(testisin) was not detectable in tiie tumours of tiiese patients, but was present in tiie 
corresponding nonnal testis specunens. indicative of a tumour suppressor role of HELA2 

30 (testisin) in testicular germ cell cancers. 
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Although restricted in nonnal tissues to the testes. HELA2 (testisin) is expressed in tumours 
of the colon, pancreas, prostate and ovary. This indicates that HELA2 (testisin) contributed 
to tumourigenesis and, therefore, has an application as a marker and also as a therapeutic anti- 
tumour target in these types of cancers. 

5 

These data point to a potentially very significant role for HELA2 (testisin) in testicular germ 
cell maturation (spermatogenesis) as weU as in the genesis of testicular germ ceU tumours. In 
accordance witii tiie present invention, it is proposed tiiat expression of HELA2 (testisin) by 
immature germ ceUs may be essential for sperm ceU development, such that loss of HELA2 
10 (testisin) expression leads to continued and uncontrolled proliferation of immature germ cells 
leading to subsequent tumourigenesis. Germ cells wherein HELA2 (testisin) is mutated or 
absent may tiius be prone to malignant transformation because of an inabihty to progress along 
the differentiation pathway. 

15 HELA2 (testisin) is weU-positioned to anchor on the surface of the germ cell where it would 
participate in a range of proteolytic activities, including ceU migration, differentiation and/or 
activation of growtii factors, receptors, or cytokines as weU as initiate additional proteolytic 
cascades. Altiiough not intending to limit die present invention to any one theory or mode of 
action, it is proposed, in accordance witti die present invention, tiiat die proteolytic target of 

20 HELA2 (testisin) is a cytokine, receptor or growdi factor essential for eitiier germ cell 
proliferation or differentiation - ie. HELA2 (testisin) may eitiwr mactivate a factor important 
for proliferation, or activate a factor which promotes differentiation. Thus, HELA2 (testisin) 
may be critical in die regulation of specific cytokmes. cytokine receptors or growth factors by 
means of post-translational proteolytic processing. That HELA2 (testisin) is not present in 

25 other nonnal tissues of the male urogenital tract, such as tiie prostate and kidney, also argues 
for such a role specific to the testis. 

Diagnostic and therapeutic applications for HELA2 (testisin) have tiie potential to be wide- 
ranging bodi in die cancer and fertility/infertility markets. In tumours, otiier tiian die testis, it 
30 is desirable to block or inhibit HELA2 (testisin) activity. As HELA2 (testism) is a member of 
die serine proteinase fiunily, for which prototype crystal structures arc known and die catalytic 
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roechanism reasonably weD characterised, the design of drugs that target HELA2 (testisin) 
proteolytfcactivityasanant^tumourtherapyshouldberelativety AsHELA2 
(testisin) is predicted to be anchored on the ceU surface, there would not be difficulties 
associated withdeUveryofdrugstointracellularoompartments. Further, it is very possible that 
5 son* tumour-associated HELA2 (testisin) may be proteolyticaUy cleaved fiom the surface of 
tumour cells, and the extracellular domam detectable in patient serum as a potential tumour 
associated marker. 

Testicular cancer is the commonest malignancy in men aged 2(M4 years. Early diagnosis 
10 correlates which an improved chance of cure and in a reduction in the severity of treatment. 
If the cancer is not treated early, it becomes very aggressive. The incidence of testicular cancer 
is significant (9/100.000) and has been rising over the last 10 years. In testicular germ cell 
tumours, such as seminoma, deUveiy of recombinant HELA2 (testisin) using gene therapy 
techniques could lead to ancst of tumour growth and potentiaUy allow commencement of 
15 norinalspetmcenmaturationanddiferentiation.thercbyreducingthenee^ 

of the testis (orchidectomy). Tliis may be particularly effective for patients who have aheady 
had one testicle removedbecauseoftesticularcancer. The risk ofcontralateral testicular cancer 
is increased in these patients and tumour development could be arrested through early treatment 
with HELA2 (testisin) to arrest growth and assist maturation of germ cells. The finding of 
20 mutant forms of HELA2 (testisin) may also lead to new maricers for seminoma. Unlike other 
testicular non-seminoma cancers where a-fetoprqtein and P-HCG are fiequendy elevated and 
can be used as tumour markers, the lack of an adequate maricer for seminoma creates difficulties 
with staging and patient follow-up. 

25 A demonstrated role for HELA2 (testisin) in sperm maturation and development would likely 
lead to improved diagnosb and new duected therapeutics for male primary infertility. Primary 
male infertiUty is responsible for conception problems in 5-10% of couples and tiie world 
market for a therapeutic in this area would be veiy substantial. Delivery of recombinant 
HELA2 (testisin) could assist sperm maturation and potentially trigger normal sperm 

30 development in some of these cases. The identification of mutant forms of HELA2 (testisin) 
could aid in diagnosis of infertility. If HELA2 (testisin) does not prove to be a tiimour 
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suppressor. but is important for sperm maturation, it could provide a new target for the 
development of a male contraceptive. If hormonal regulation of HELA2 (testisin) can be 
demonstrated. HELA2 (testisin) may prove effective for tfie ti^atment of conditions arising 
fiom dysfunctional hoimal responses, such as cryptorchidism, which is associated witii botii 
5 infertility ami seminoma development. 

Accoidingly, the present invention contemplates a pharmaceutical composition coiiqjrising 
proteinaseAcinase or a derivative tiiereof or a modulator of proteinasefldnase expression or 
proteinasetoiase activity and one or more pharmaceutically acceptable carriers and/or diluents. 
10 These components are referred to as die "active ingredients" and include, for example, HELA2 
(testisin). 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions (where 
water soluble) and sterile powders for tfie extemporaneous preparation of sterile mjectable 

15 solutions. It must be stable under the conditions of manufacuire and storage and must be 
preserved against tiie contaminating action of microorganisms such as bacteria and fiingi. The 
carrier can be a solvent or dispersion medium containing, for example, water, etimnol. polyol 
(for example, glycerol, propylene glycol and liquid polyetiiylene glycol, and the like), suitable 
mixtures ttieieof; and vegetable oils. The proper fluidity can be maintained, for example, by tiie 

20 use of a coating such as lecidiin, by tiie maintenance of the required particle size in die case of 
dispersion and by tiie use of superfactants. The preventions of tiie action of microorganisms 
can be brought about by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimersal and tiie like. In many cases, it wUl be preferable 
to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of 

25 the injecteble compositions can be brought about by the use in the compositions of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating die active compounds in tiie required 
an»unt in tiie appropriate solvent witii various of tiie otiier ingredients enumerated above, as 
30 required, foltowed by filtered steriBzatioa Generally, dispersions arc prepared by incorporating 
tiK various sterilized active ingredients into a sterile vehwle which contains tiie basic dispersion 
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medium and the required other ingredients fiom those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of preparation 
are vacuum drying and the fieeze-drying technique which yield a powder of the active 
ingredient plus any additional desired ingredient fiom previously sterile-filtered solution thereof. 



5 



When the active ingredients aie suitably protected tbey may be orally administered, for example, 
with an inert diluent or with an assimilable edible carrier, or it may be enclosed in hard or soft 
shell gelatin capsule, or it may be compressed into tablets, or it may be incorporated directiy 
with the food of tiie diet. For oral therapeutic administration, the active compound may be 
10 incoiporated with excipients and used in tiie form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and preparations 
should contain at least 1% by weight of active compound. The percentage of die compositions 
and preparations may, of course, be varied and may convenientiy be between about 5 to about 
80%oftheweightoftheunit Hieamountof active compound in such therapeutically useful 
15 compositions in such that a suitable dosage will be obtained. Preferred compositions or 
preparations according to the present invention are prepared so that an oral dosage unit form 
contains between about 0. 1 A^g and 2000 mg of active compound. 

The tablets, troches, pills, capsules and the like may also contain the components as listed 
20 hereafter. A binder such as gum, acacia, com starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as com starch, potato starch, alginic acid and tiie like; 
a lubricant such as magnesium stearate; and a sweetening agent such a sucrose, lactose or 
saccharin may be added or a flavouring agent such as peppermint, oil of wintergreen. or cherry 
flavouring. When tije dosage unit form is a capsule, it may contain, in addition to materials of 
25 the above type, a liquid carrier. Various other materials may be present as coatings or to 
otherwise modify the physical form of the dosage unit. For instance, tablets, piUs, or capsules 
may be coated with shellac, sugar or both. A syrap or eUxir may contain die active compound, 
sucrose as a sweetening agent, methyl and propylparabens as preservatives, a dye and 
flavouring such as cherry or orange flavour. Of course, any material used in preparing any 
30 dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts 
employed. In addition, tiie active compound(s) may be incorporated into sustained-release 
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preparations and formulations. 

Pharnmceutically acceptable carriers and/or diluents include any and all solvents, dispersion 
roedia, coatings, antibacterial and antifimgal agents, isotonic and absorption delaying agents and 
5 thelike. Theuseofsuchn^liaandagentsforpharinaceuticalactivesubstancesisweUknown 
in the art. Except insofar as any conventional media or agent is incompatible with the active 
ir,gredient. use thereof in the therapeutic compositions is contemplated. Supplementary active 
ingredients can also be incorporated into the compositions. 

10 It is especiaUy advantageous to formulate parenteral compositions in dosage unit fonn for ease 
of administration and uniformity of dosage. Parental compositions are generally suitable for 
administration by the intravenous, subcutaneous or intramuscular routes amongst other routes 
of administration. Dosage unit form as used herein refers to physically discrete units suited as 
unitarydosagesforthemammaliansubjectstobetreated; each unit containing a predetermined 

15 quantity of active material calculated to produce the desired therapeutic effect in association 
with the required pharmaceutical carrier. The specification for the novel dosage unit forms of 
the invention are dictated by and direcdy dependent on (a) the unique characteristics of the 
active material and the particular therapeutic effect to be achieved, and (b) the limitations 
inherent in the art of compounding such an active material for the treatment of disease in living 

20 subjects having a diseased condition in which bodily health is impaired as herein disclosed in 
detail. Other forms of administration include but are not limited to intranasal, buccal, rectal, 
suppository, inhalation, intracerebral and intraperitoneal. 

The principal active ingredient is compounded for convenient and effective administration in 
25 effective amounts with a suitable pharmaceutically acceptable carrier in dosage unit form as 
hereinbefore disclosed. A unit dosage form can. for example, contain the principal active 
compound in amounts ranging from 0.5 Mg to about 2000 mg. Expressed in proportions, the 
active compound is generaUy present in from about 0.5 jig to about 2000 mg/ml of carrier. In 
the case of compositions containing supplementary active ingredients, the dosages are 
30 determinedbyrcferencetotheusualdoseandmannerofadniinistrationofthesaidin^ 
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The effective amounts include amounts calculated or predicted to have the desired effect and 
range fiom at least about 0.01 ng/kg body weight to about 10,000 mg/kg body weight. 
Alternative amounts include 0.1 ngflcg body weight to about 1000 ng/kg body weight 

5 The pharmaceutical composition may also comprise genetic molecules such as a vector capable 
of transfecting target cells where the vector carries a nucleic acid molecule capable of 
modulating proteinase/kinase expression or proteinaseAdnase activity. The vector may, for 
example, be a viral vector. This form of therapy is proposed to be particularly useful for gene 
replacement or enhancement therapy for HELA2 (testisin) especially for the modulation of 

10 fertility and/or treatment of testicular cancer. 

StUl another aspect of the present invention is directed to antibodies to proteinase/kinase and 
its derivatives. Such antibodies may be monoclonal or polyclonal and may be selected fiom 
naturally occurring antibodies to proteinase/kinase or may be specifically raised to 

15 proteinasetose or derivatives thereof. In the case of the latter, proteinase/kinase or its 
derivatives may first need to be associated with a carrier molecule. The antibodies and/or 
recombinant proteinascfldnase or its derivatives of the presem invention are particularly usefiil 
as therapeutic or diagnostic agents. For example, monitoring non-testicular cancer by 
measuring HELA2 (testisin) or screening for tiie presence of testicular cancer by an absence of 

20 HELA2 (testisin). 

Proteinase/kinase and its derivatives may also be used to screen for naturally occurring 
antibodies to proteinasetoiase. These may occur, for example in some autoimmune diseases. 
Ahematively. specific antibodies can be used to screen for proteinaseAdnase. Techniques for 
25 such assays are weU known in tiie art and include, for example, sandwich assays and ELISA. 
Knowledge of proteinaseflcinase levels may be important for diagnosis of certain cancers or a 
predisposition to cancers or for monitoring certain tiierapeutic protocols. 

Antibodies the proteinaseAdnase of tiie present invention may be monoclonal or polyclonal. 
30 Alternatively, ftagments of antibodies may be used such as Fab fragments. Furthermore, tiie 
present invention extends to recombinant and synthetic antibodies and to antibody hybrids. A 
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"synthetic antibody" is considered herein to include firagments and hybrids of antibodies. The 
antibodies of this aspect of the present invention are particularly useful for immunotherapy and 
may also be used as a diagnostic tool for assessing apoptosis or monitoring die program of a 
tiierapeutic regimen. 

5 

FoT example, specific antilwdies can be used to screen for proteinase/kinase proteins. The latter 
would be important, for example, as a means for screening for levels of proteinase/kinase in a 
ceU extract or other biological fluid or purifying proteinaseMMse made by recombinant means 
from culture supernatant fluid. Techniques for the assays contemplated herein arc known in die 
10 art and include, for example, sandwich assays and ELISA. 



It is within the scope of this invention to include any second antibodies (monoclonal, polyclonal 
or fragments of antibodies or synthetic antibodies) directed to die first mentioned antibodies 
discussed above. Botii die first and second antibodies may be used in detection assays or a first 
15 antibody may be used witii a commeiciaUy available anti-immunoglobulin antibody. An 
antibody as contemplated herein includes any antibody specific to any region of 
proteinase/kinase. 

Both polyclonal and monoclonal antibodies are obtainable by immunization witii die enzyme 
20 or protein and either type is utilizable for immunoassays. The meUiods of obtaining both types 
of sera are weU known in the art Polyclonal sera are less preferred but arc relatively easUy 
prepared by injection of a suitable laboratory animal witii an effective amount of 
proteinase/kinase, or antigenic parts tiiereof, collecting serum from the animal, and isolating 
specific sera by any of die known immunoadsorbent techniques. Altiiough antibodies produced 
25 by this method are utilizable in virtually any type of immunoassay, tiiey are generaUy less 
favoured because of the potential heterogeneity of the product. 

The use of monoclonal antibodies in an immunoassay is particularly preferred because of die 
ability to produce tiiem in large quantities and die homogeneity of die product. The preparation 
30 of Iq/taidoma cell lines for monoclonal antibody production derived by fusing an immortal ceU 
line and lymphocytes sensitized against tiie immunogenic preparation can be done by techniques 
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which are well known to those who are skilled in the art. 

Another aspect of the present invention contemplates a method for detecting proteinase/kinase 
in a biological sample fiom a subject said method comprising contacting said biological sample 
5 with an antibody specific for proteinase/kinase or its derivatives or homologues for a time and 
under condiUons sufficient for an antibody-proteinase/kinase complex to form, and then 
detecting said complex. 

The presence of proteinase/kinase may be accomplished in a number of ways such as by 
10 Western blotting and ELISA procedures. A wide range of immunoassay techniques are 
available as can be seen by reference to US Patent Nos. 4,016.043, 4. 424,279 and 4.018,653. 
These, of course, includes both single^site and two-site or "sandwich" assays of the non- 
competitive types, as weU as in the traditional competitive binding assays. These assays also 
include direct binding of a labelled antibody to a target 

15 

Sandwich assays are among tiie most useful and commonly used assays and are fevoured for 
use in tiie present invention. A number of variations of the sandwich assay technique exist, and 
all are intended to be encompassed by the present invention. Briefly, in a typical forward assay, 
an unlabelled antibody is immobilized on a solid substrate and the sample to be tested brought 

20 into contact with the bound molecule. After a suitabte period of incubation, for a period of time 
sufficient to allow formation of an antibody-antigen complex, a second antibody specific to the 
antigen, labelled with a reporter molecule capable of producing a detectable signal is then added 
and incubated, allowing time sufficient for tiie formation of anotiier complex of antibody- 
antigen-labelled antibody. Any unteacted material is washed away, and tiie presence of tiie 

25 antigenisdeterminedbyobservationofasignalproducedbythereportermolecule. Theresults 

may eitiier be qualitative, by simple observation of tiie visible signal, or may be quantitated by 
comparing with a control sample containing known amounts of hapten. Variations on tiie 
forward assay include a simultaneous assay, in which botii sample and labelled antibody are 
added simultaneously to the bound antibody. These techniques arc well known to tiiose skilled 
30 in tiie art, including any minor variations as will be readily apparent In accordance witii tiie 
present invention tiie sample is one whk;h might contain proteinase/kinase including cell extract 
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tissue biopsy or possibly serum. saUva, mucosal secretions, lymph, tissue fluid and respiratory 
fluid. The sample is, therefore, generaUy a biological sample comprising biological fluid but 
also extends to fermentation fluid and supernatant fluid such as from a ceU culture. 

5 In the typical forward sandwich assay, a first antibody having specificity for the 
proteinaseMnase or antigenic parts thereof, is either covalently or passively bound to a solid 
surface. The solid surface is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyaciylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The 
solid supports may be in the form of tubes, beads, discs of microplates, or any other surface 

10 suitable for conducting an immunoassay. The binding processes are well-known in the art and 
generaUy consist of cross-linking covalenUy bindmg or physically adsorbing, the polymer- 
antibody conptex is washed in preparation for the test sample. An aliquot of the sample to be 
tested is then added to the solid phase complex and incubated for a period of time sufficient 
(e.g. 2-40 minutes or overnight if more convenient) and under suitable conditions (e.g. from 

1 5 about room temperature to about YTQ to altow binding of any subunit present in the antibody. 
FoUowing the incubation period, the antibody subunit solid phase is washed and dried and 
incubated with a second antibody specific for a portion of the hapten. The second antibody is 
linked to a reporter molecule which is used to indicate the binding of tiie second antibody to the 
hapten. 

20 

An akemative method involves immobilizing the target molecules in the biological sample and 
then exposing the immobilized target to specific antibody which may or may not be labelled 
witii a reporter molecule. Depending on the amount of target and the strength of the reporter 
molecule signal, a bound target may be detectable by direct labelling with the antibody. 
25 Alternatively, a second labelled antibody, specific to tiie first antibody is exposed to the target- 
first antibody complex to form a target-first antibody-second antibody tertiary complex. The 
complex is detected by the signal emitted by the reporter molecule. 

By "reporter molecule" as used ui the present specification, is meant a molecule which, by its 
30 chemical nature, provides an analyticaUy identifiable signal which allows tiie detection of 
antigen-bound antibody. Detection may be either quaUtative or quantitative. The most 
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conunonly used reporter molecules in this type of assay are either enzymes, fluorophores or 
radionucleotide containing molecules (i.e. radioisotopes) and chemUuminescent molecules. 
In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 
generally by means of glutaraldehyde or periodate. As will be readUy recognized, however, a 

5 wide variety of different conjugation techniques exist, which are readily available to the skUled 
artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, beta- 
galactosidase and alkaline phosphatase, amongst others. The substrates to be used with the 
specific enzymes are generally chosen for the production, upon hydrolysis by the corresponding 
enzyme, of a detectable colour change. Examples of suitable enzymes include alkaline 

10 phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, which yield 
a fluorescent product rather than the chromogenic substrates noted above. In aU cases, the 
enzyme-labelled antibody is added to the first antibody hapten complex, allowed to bind, and 
then ttK excess reagent is washed away. A solution containing the appropriate substrate is then 
added to die complex of antibody-antigen-antibody. The substrate will react with tiie enzyme 

15 linked to tiie second antibody, giving a qualitative visual signal, which may be further 
quantitated. usually spectrophotometrically. to give an indkation of tiie amount of hapten which 
was presem in die sample. "Reporter noolecule" also extends to use of cell agglutination or 
inhibition of agglutination such as red blood ceUs on latex beads, and die like. 

20 Alternately, fluorescent compounds, such as fluorescein and thodamine. may be chemically 
coupled to antibodies without altering tiieir binding capacity. When activated by illumination 
with Ught of a particular wavelengtii. die fluorochrome-labelled antibody adsorbs tiic Ught 
energy, inducing a state to excitabUity in the molecule, followed by emission of die Ught at a 
characteristic colour visuaUy detectable with a light microscope. As in the EIA. the fluorescent 

25 labelled antibody is aUowed to bind to tiK first antibody-hapten complex. After washing off the 
unbound reagent, the remaining tertiary complex is tiien exposed to the light of the appropriate 
wavelengtii tiie fluorescence observed indicates die presence of tiie hapten of interest. 
Immunofluorescene and EIA techniques are botii very well estabUshed in die art and are 
particularly preferred for ttie present metiiod. However, otiier reporter molecules, such as 

30 radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 
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The present invention also contemplates genetic assays such as involving PGR analysis to detect 
proteinasetose gene or its derivatives. Alternative mettiods or metijods used in conjunction 
include direct nucleotide sequencing or mutation scanning such as single stranded conformation 
polymorphonis analysis (SSCP), specific oUgonucleotide hybridisation, and metiiods such as 
5 direct protein truncation tests. 

The nucleic acid molecules of the present invention may be DNA or RNA. When tiie nucleic 
add molecule is in DNA form, it may be genomic DNA or cDNA. RNA forms of tiie nucleic 
acid molecules of die present invention are generally mRNA. 

10 

AMwugh tiie nucleic acid molecules of tiie present invention are generally in isolated form, they 
may be integrated into or ligated to or otiierwise fused or associated witii otiier genetic 
rnokculessuchasvectormoleculesandinparticularexpressionvectormolecules. Vectorsand 
expression vectors are generally capable of repUcation and. if applicable, expression in one or 
15 botti of a prokaryotic cell or a eukaryotic cell. Preferably, prokaryotic cells include K coli, 
BacaiusspaDdPseudomonassp. Preferred eukaryotic cells include yeast, fimgal, mammalian 

and insect cells. 

Accordbgly, another aspect of the present invention contemplates a genetic construct 
20 comprising a vector portion and a mammalian and more particularly a human proteinase/kinase 
gene portion, which proteinaseMiase gene portion is capable of encoding an proteinase/kinase 
polypeptide or a functional or immunologically interactive derivative diereof. 

Preferably, tiie protemaseAdnase gene portion of tiie genetic construct is operably linked to a 
25 promoter on the vector such tiiat said promoter is capable of directing expression of said 
proteinaseAdnase gene portion in an appropriate cell. 

In addition, tiie proteinaseAdnase gene portion of tiie genetic construct may comprise aU or part 
of the gene fused to anotiier genetic sequence such as a nucleotide sequence encoding 
30 glutatiiione-S-transferase or part thereof. 



- ^098/36054 



PCT/AU98/00085 



-39- 



The present invention extends to such genetic constructs and to prokaryotic or eukaryotic ceUs 
comprising same. 

The present invention also extends to any or all derivatives of proteinase^ase including 
5 mutants, part, fragments, portions, homologues and analogues or their encoding genetic 
sequence including single or multiple nucleotide or amino acid substitutions, additions and/or 
deletions to tiie naturaUy occurring nucleotide or amino acid sequence. The present invention 
further encompasses hybrids between the proteinase/kinases such as to broaden the spectrum 
of activity and to ligands and substrates of tiie proteinase/kinase. 

10 

The proteinase/kinase and its genetic sequence of the present invention will be useful in tiie 
generation of a range of therapeutic and diagnostic reagents. 

Soluble proteinase/kinase polypeptides or otiier derivatives, agonists or antagonists are also 
15 contemplated to be usefiil in tiie treatment of disease, injury or abnormality in the nervous 
system. c.g. in relation to central or peripheral nervous system to treat Cerebral Palsy, trauma 
induced paralysis, vascular ischaemia associated with stroke, neuronal tumours, motoneurons 
disease. Paridnson's disease. Huntington's disease, Alzheimer's disease. Multiple Sclerosis, 
peripheral neuropatiiies associated with diabetes, heavy metal or alcohol toxicity, renal failure 
20 andinfi5ctiousdiseasessuchasherpes.rubella,measles, chicken POX.HW or HTL Otiier 
conditions for which tiie proteinasefldnase are usefiil include cancer, metastasis and autoimmune 
disease amongst many oti^rs. Particular appUcations for HELA2(testisin) include as a marker 
for non-testicular cancers, in the treatment of testicular cancer and in flie treatment of infertility 
or in inducing infertility such for contraception. 

25 

A further aspect of tiie present invention contemplates tiie use of proteinasetose or its 
fiinctional derivatives in tfie manufecture of a medicament for tiie treatment of proteinaseflcinase 
mediated conditions defective or deficient. 

30 The present invention is further described by tiie foUowing non-limiting Examples. 
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EXAMPLEl 
aX>NING PR0C3EDURES 

In order to identify serine proteinases that may be involved in regulatory ceUular functions, a 
5 genetic screening approach was applied using degenerate primers corresponding to conserved 
regions of serine proteinases (amino acids flanking His- and Ser- residues) to anq)lify gene 
fragments spanning these regions from cDNA. using a low stringency RT-PCR (Reverse 
Transcriptase-Polymerase Chain Reaction) approach. 

10 By this technique, the aim was to isolate low abundance genes as well as those present in 
moderate to high abundance. The cDNA used for these experiments was isolated from a HeLa 
cellcytotoxicily model wherein PAI-2 expression inhibits TNF(-induced apoptosis (Dickinson 
et al J. Biol Chem. 270: 27894-27904. 1995). These PAI-2 expressing cells provide a unique 
and viable system for investigating 1NF(signamng pathways as they are protected from the 

15 cytotoxic effects of TNF). 

cDN A was generated from RNA isolated from HeLa cells and PAI-2 expressing HeLa ceUs, 
both untreated and following treatment with TNF and cycloheximide. Amplification of each 
cDNA population using PGR and the following serine proteinase degenerate primers, 

20 

His Primer: 5'ACAGAATTCTGGGTIGTIAqGCIGaCAYTG3' [SEQ ID NO: 1], 
Ser Primer: 5"ACAGAATrCAXIGGICacaaGT/AXTaCX:3' [SEQ ID NO:2] 

(where X= A or G; Y= C or T; 1= Inosine) 

25 produced DNA fragments in the range of 480bp, the approximate predicted size of the serine 
proteim.se intergenic region. These amplified DNA fragments were cloned into E. coli 
generating a library containing approximately 150 independent clones. The inventors analysed 
36 of these clones and found that 9 encoded previously identified serine protemases or 
tissue-type or urokinase-type.plasminogen activators, thereby demonstrating the efficacy of this 

30 approach. Of the other 36. two were found to encode novel open reading frames with high 
homology to serine proteinases and are referred to herein as "HELA2" (or "testisin") and 
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-ATC2". One additional clone designated herein. "BCON3". showed homology to a kinase. 
Extenston of the DNA fragments by RACE in both 5' and 3" directions using internally derived 
primershas verified thehomologyofHEI^2 and ATC2 to the serine proteinase f^^^^ Each 
of the three DNA sequences are unique in that they are markedly different from any known 
5 DNA or protein sequence in the Genbank and NBRF databases. 

EXAMPLE 2 
HELA2 SERINE PROTEINASE (TESTISIN) 

10 The HELA2 mRNA transcript is approximately 1.5kb as determined from Northern blot 
analysis. Nucleic acid sequence was obtained for about 1 . 1 kb of HELA2 which spans the entire 
coding region, the 3" noncoding region and part of the 5" noncoding region. The coding region 
starts with an ATG codon which is prt«ent in a moUf analogous to the Kozak eukaryotic 
translation initiation consensus sequence. Alignment of the deduced amino acid sequence of 

1 5 HELA2 widi homologous serine proteinases shows that the cDN A encodes a 3 14 amino acid 
(aa) polypeptide with a calculated molecular weight of 34.8kD (called Testisin). which is 
synthesized as a zymogen containing pre-, pro- and catalytic regions (Figure 1 ). The pro- region 
(or light chain) and tiie catalytic region (heavy chain) are delineated by a classic serine 
proteinase activation motif Arg-Ile-Val-Gly-Gly [SEQ ID NO:24] with cleavage likely 

20 occurringbetweenArgandlle. TIk catalytic region includes tiie catalytic triad of His. Asp and 
Ser in positions and motifs which are highly conserved among tiie serine proteinases. Ten Cys 
residues occur in conserved positions: by analogy to otiier serine proteinases, eight of these 
function to form disulfide bridges vdthin die catalytic region and ihc remaining two Unk die pro- 
and catalytic regions. 

25 

Structural features conserved in tiie binding pockets of serine proteinases are present in HELA2 
(testisin). An Asp residue at the bottom of the serine proteinase binding pocket sk residues 
before die active site Ser in HELA2 (testisin) indicates tiiat HELA2 (testisin) has trypsin-like 
specificity, witii proteolytic cleavage after Arg or Lys in target substrates. HELA2 (testisin) also 
30 contains a conserved Ser-Trp<}ly motif at tiie top of tiie binding pocket which is likely involved 
in hydrogen bonding witii target substrates in otiier serine proteinases. 
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A hydrophobicity plot of the HELA2 (testisin) amino acid sequence (Figure 1) identifies two 
hydrophobic regions, one located at the amino terminus and the other at the carboxy terminus. 
Tie 20 aa amino terminal hydrophobic region is Bkriy to be a signal peptide, which would direct 
newly synthesized HELA2 to enter the endoplasmic reticulum. The 16 aa hydrophobic caiboxy 

5 terminus of HELA2 (testisin) shows high homology to the transmembrane domain of prostasin 
(Figure 2), suggesting that HELA2 (testisin) is likely to be a membrane-anchored serine 
proteinase. Thus HELA2 (testisin) may anchor on the germ ceU surface where it could 
participate in a range of proteolytic activities, including participation in cell migration, 
differentiation and/or activation of growth factors and proteolytic cascades. In prostasin. diis 

10 protruding caiboxy terminus may be cleaved, thus releasing the serine proteinase from the 
membrane. A simUar cleavage event may also occur with Testasin. 

Two isoforms of HELA2 were identified in a HeLa cell cDNA library (Stratagene UniZap HeLa 
Library) which difer by an insertion of 6 nucleotides which generates a Sfi 1 restriction enzyme 
15 site. At the protein level, there is a corresponding insertion of 2 aa's (Tyr-Ser) within the 
catalytic binding pocket (Rgure 2A). The two isoforms of HELA2 cDN A are referred to as the 
short (S) and long (L) isoforms. respectively. The nucleotide and corresponding amino acid 
sequence for the short isoform of HELA2 is shown in SEQ ID NOs. 3 and 4. respectively. The 
long isoform is shown in SEQ ID N0:5 and 6. respectively. 

20 

EXAMPLES 

GENERATION OF FULL LENGTH cDNA ENCODING HELA2 CTESTISIN) 

Partial cDNA fragments of the short and long isoforms of HELA2 were obtained using a 
25 combinationoflibraryscieeningtechniques. Plasmids containing the full length cDNA of the 
two isoforms were then generated in pBluescriptSK(-) by ligating restriction enzyme-digested 
fragments of the partial cDNAs. A plasmid map of tiie two generated constructs. 
pBluescriptHELA2(S) and pBluescriptHELA2(L). and a restriction enzyme map of tiie long 
isoform cDNA are diagrammed in Figure 3. 

30 

In vitro tiMscriptionAranslation using HELA2 cDNA shows a major specific product of 
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approximatdy 35kD (Fig. 2B). which is the same as size predicted from the open reading frame, 
demonstrating that HELA2 cDNA encodes a protein. The translationAranscription coupled 
rabbit reticulocyte lysate system (Promega) was used as per the manufacturer's instructions for 
35S-methionine labelling. Clones of HELA2 in pBluesciipt a PAI-2 positive control were used 
5 with T3-RNApolymMase (sense direction), 

EXAMPLE 4 

EXPRESSION OF RECOMBINANT HELA2 (TESTISIN) IN E.COLI 

10 (A) Generation of expression constructs 

(i) His(6)-tagged recombinant HELA2 (testisin) 
To reduce potential toxic effects on host cells, and therefore optimise expression, a strategy was 
employed to eliminate the hydrophobic residues of the secretory and membrane anchoring 
domains of HELA2 (testisin) CTestisin (20-295)). Testisin (20-295) fragments which were His6 

15 taggedateithertheaminoorcarboxyterminalwereobtainedbyPCRandexpressionco^^ 

were generated by inserting these into pQE vectors (Qiagen). 

The primers used to generate the amino-terminal tagged protein were: 
forward: 5' GCACAGTCGACCAAGCCGGAGTCGCAGAG 3' [SEQ ID NO: 1 1] and 
20 reverse: 5'GCACAAAGdTGCCAGGAGGGGTCTGGCrG3' [SEQ ID N0:12] 

The amplification pioduct of 858bp was digested with SaD and HindHI and ligated into pQE-10 
to give pQE-10(20-295)N (Figure 4). 

The primers used to generate the carboxy-terminal tagged protein were: 
25 forward: 5' GCACAACCATGGCCAAGCCGGAGTCGCAGGAG 3' [SEQ ID NO:13] and 
reverse 5' GCACAAGATCTCCAGGAGGGGTCTGGCTG 3' [SEQ IDNO:141. 
The amplification pioduct of 859 bp was digested with Ncol and Bgffl and ligated into PQE-60 
to give pQE-60(20-295)C (Figure 4). 

30 (ii) GST-tagged recombinant HELA2 (testisin) 

In order to generate a fiision of glutathione-S-transfeiase (GST) and HELA2 (testisin). 
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pBluescriptHELA2(S) was digested with Sau3Al. leleasing a 570bp DNA fragment encoding 
the iWaminoacidsatthecaiboxyterininaiendof HEI^(testisin). This DNA fragment was 
doned into the BamHl site of pGEX-1 generating pGEX-l(90-279) (Figure 4) and subjected 
to DNA sequence analysis to confinn that the fiision was in frame. 

5 

(b) Expression of His-tagged HELA2 (testisin) in E. coli 

pQE10(20-295)N and pQE60(20-295)C plasmids were electro-transfoimed into E. coli DH5( 
cells. Four different clones were selected for further analysis: ffis-N21 expressing amino 

10 terminal His6-tagged Testisin (20-295); and His-C21. ffis-C22, and His-C23 expressing 
carboxy terminal His6-tagged Testisin (20-295). To express recombinant HELA2 (testisin) 
protein, transformed cells were grown to log phase then induced for 4 hours in the presence of 
2mM IPTO. Cfclls were lysed in a denaturing lysis buffer containing 8M urea. O.IM NaH2P04 
and 0.01M TrismCI pH8. Alternatively the ceUs were lysed in a non-denaturing lysis buffer 

15 containing 0.1MNaH2PO4.0.1MNaa and 0.01 MTrisAHapHS. The ffis6 tagged protein 
was recovered by mixing the lysate with a metal affinity resin (Qiagen or Clontech). Purified 
testisin(L) was eluted with 100 mM EDTA in lysis buffer (pH 6.3). A major band of 
approximately 32 kDa was obtained in the eluate as shown by the arrows in Figure 5A. 
Western blot analysis of a purification of the His^3 clone using an anti-His6 antibody showed 

20 that the band at 32 kDa was ffis6 tagged HELA2 (testisin) (Figure 5B). 

EXAMPLES 
IMMUNOLOGY 

25 (A) Rabbit Polyclonal Antibodies Directed Against HELA2 (testisin) Peptide Antigens 

Three peptides were selected from tiie HELA2 (testisin) amino acid sequence on the basis of 
predicted antigenicity, hydiophiUcity and lack of identity witii known proteins (Figure 6). 



30 Peptide antigen T20-33 KPESQEAAPLSGPC [SEQ ID N0:15] 

Peptide antigen T46-63 EDAELGRWPWQGSLRLWDC [SEQ ID NO: 16] 
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Peptide antigen T175-190 GYIKEDEALPSPHTLQC [SEQ ID NO:17] 

These peptides were synthesized (Auspep) and coupled to keyhole limpet hemocyanin. The 
coupled peptide (500 Fg) in PBS (0.5 ml) was emulsified in an equal volume of Freund's 

5 complete adjuvant before injection into a rabbit. Booster injections of coupled peptide in 
Freund's incomplete adjuvant were made at intervals of 2 to 3 weeks. Each rabbit was bled 
(approximately 1 ml) before the initial injection and about 7 days after the second and 
subsequent boosters and the antibody titre assessed by direct ELISA assay. Immunoreactive 
antisera against the peptide antigens was demonstrated and when a sufficienUy high titre was 

10 achieved (after 3 to 5 boosters), between 12 and 25 ml of bkxxi was removed from each animal. 

Rabbit antisera was affinity purified against the respective immunising peptides by 
chromatography using peptide-coupled affinity columns. Immunoreactivity of the affinity 
purified antibodies against HELA2 (testisin) was demonstrated by Western blot analysis of 

15 GST-tagged recombinant HELA2 (testisin). pGEX- 1(90-279) plasmid DNA (described in 
Example 4) was electio-transfonned into E. coll DH5( cells and induced for 3 hours in the 
presence of 0.5mM IPTG. CeUs were lysed in 1.5% sarcosyl, 2% Triton XlOO and then 
sonicated. After removal of the insoluble fraction by centrifiigation, the cell lysate was mixed 
with a 50% slurry of Glutathione Sepharose 4B, washed, and the purified GST-Testisin(90-279) 

20 was eluted by boiling with SDS-Sample buffer. Figure 7 shows an example of Western blot 
analysis of the eluate using anti-Peptide T175-190 antibody demonstrating a purified, 
immunoreactive band representative of GST-linked HELA2(testisin) of approximately 47 kDa. 

(B) Rabbit Polyclonal Antibodies Directed Against Purified Bacterially Expressed HELA2 
25 (testisin) 

An SDS-PAGE gel slice containing purified ffis6 tagged HELA2 (testisin) (as described in 
Example 4, part (b)) is to be confined with adjuvant and rabbits immunized as described above. 
Rabbit antisera are tested by Western blot analysis for immunoreactivity against purified 
30 recombinant HELA2 (testisin) and HELA2 (testisin) in cell extracts, as well as use in 
iminunohistochemical analyses. 
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EXAMPLE 6 

EXPRESSION OF HELA2 (TESTISIN) IN EUKARYOTIC CELLS 

(A) Generation of expression constructs 

5 

Eukaryotic expression constructs encoding testisin(s) and tcstisin(L) His6 tagged at the carboxy 
terminal were generated in the eukaryotic expression vector pcDNA3 (Invitiogen). DNA 
fragments encoding HELA2 (testisin) were generated by PCR from both pBluescriptHELA2(S) 
and pBluescriptHELA2(L) using the primers: 
10 forward: 5' GCACAGGTACCGAGGCCATGGGCGCGCGC 3' [SEQ ID NO: 18] and 
reverse 5' gCACATCTAGATCAGTGGTGGTGGTGGTGGTGGACCGGCCCCAGGA 

GTGG 3' [SEQ ID NO: 19] 

The PCR product of 985 bp obtained ftom amplification of HELA2 (testisin) from 
15 pBluescriptHELA2(S) as template was ligated into pGEM-T (Easy) vector (Promega). 
Digestion of this shuttle construct with Noa released a 1025 bp fragment which was Ugated into 
pcDNA3 generating the short isoform expression construct pcDNA3-Test(S-Q (Figure 8). 
PGR amplification of the long isoform template gave a 991 bp product which was Ugated into 
pGEM-T (easy) vector. NotI digestion of the shutUe construct released a 1031 bp fragment 
20 which was Ugated into pcDNA3 giving pcDNA3-Test(L-Q (Figure 8). 

Soluble testisin (l-295)-His6 in which the membrane anchoring sequence is deleted and the 
protein is carboxy-His6 tagged is to be obtained by PCR amplification of HELA2 (testisin) from 
pBluescriptHELA2(L) using the primers: 
25 forward: 5" GCACAGCGGCCGCGAGGCCATGGGCGCGCGC 3' [SEQ ID NO:20] and 
reverse: 5' GCACAGCGGCCGCTCAGTGGTGGTGGTGGTGGTGCCAGGAGGGGTC 
TGGCTG 3* [SEQ ID NO:21]. 

The PCR product wiU be digested with Nott and Ugated into pcDNA3 generating the long 
isoform expression construct pcDNA3-Test(l-295)L-C (Figure 8). 

30 

(B) Expression and ceUular localisation of HELA2 (testisin) 
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Each of the repression constructs is transiently transfected into a eukaryotic ceU line (eg. HeLa, 
CHO or COS cells) by electroporation. Expression is confirmed by Northern blot and 
immunoblot. The His6 tag is a small, uncharged tag which reportedly does not interfere with 
cellular membrane interactions and is able to be detected with anti-His6 antibodies. HELA2 

5 (testisin) cellular localisation is analysed by immunofluorescence using antibodies directed 
against the His6 tag andstainedcellsexaminedbyconfocalmicroscopy. Mock transfected ceUs 
is monitored as one of the controls in these experiments. Cells are examined under 
non-permeablised and permeabilised conditions to investigate intraceUular and ceD surface 
expressionof HELA2 (testisin) Ugged proteins. Possible release of HELA2 (testisin) into the 

10 supernatant is monitored by immunoblotting of cor^ditioried media. Association of HELA2 
(testisin) witii a particular cellular compartment is confirmed by ceUular fractionation studies. 
Stable transfectants of Uie M length and truncated Ugged HELA2 (testisin) is generated by 
selection in G418. Recombinant HELA2 (testisin) is purified from tiiese stable transfectants 
using ametalafiinity resin (eg. Qiagen or Clontech) for assay of its bioactivity andefficacy as 

15 a tiier^utic reagent 

EXAMPLE? 

HELA2 (TESTISIN) IS SPECIFICALLY EXPRESSED IN 
THE NORMAL TESTIS, AND IS ASSOCIATED WITH SPERM DEVELOPMENT 

20 

(A) Normal Tissue Blot 

Dot btot analysis of PolyA+ RNA irom 50 normal tissue specimens (strandardised to 8 different 
housekeeping genes) (Clontech) was performed using a 32P-labelled HELA2 (testisin) probe. 
25 HybridizationoftiieradiolabelledprobewasinExpressHybsolution(Clontech)at65o. The 

Wots were washed to a final stiingency of 0. lxSSCA).5% w/v SDS. High level expression of 
HELA2 (testisin) was fomid only in the testis as shown by the histogram plot of tiie Signal 
Intensity in Figure 9. In contrast, probing of the same blot witii BC0N3 showed ubiquitous 
expression of BC0N3 mRNA in a variety of tissues (Figure 9). 

30 

(B) Multiple Tissue Northern Blot 
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Northem blots displaying polyA+ mRNA from 16 different normal tissues (Clontech) were 
hybridised at 65«>C in ExpressHyb solution using a 400bp SacD/EcoRI 32P-labeUed HELA2 
probe for 3h and then washed to a final stringency of 0.1xSSCV0.1%SDS at 6(fC. After a 5h 
exposure, a strong band was observed only in the lane containing testis mRNA. demonstrating 
5 the specificity ofHELA2(testisin) expression for the testis (Figure lOA). Prolonged exposure 
(4.5 days) of the blot revealed a very low level of HELA2 (testisin) mRNA expression in the 
prostate, hmg and pancreas only. In contrast to HELA2. BC0N3 is expressed in mRNA from 
most tissues present on the blot (Figure lOB). 



10 

(C) HELA2 (testisin) is Expressed in Sperm Cells. Demonstrating its Gerai Cell Origin 

To determine whether HELA2 (testisin) expression is associated with germ ceUs of the testis, 
ejaculate specimens from normal fertile males were con^ with those of post-vasectomy 
15 males by RT-PCR analysis using HELA2 (testisin) specific primers. Sperm is tiie primary 
product ftom ti»e testis that is found in ejaculate; otiier components of the ejaculate are derived 
from the prostate. 

Fust strand cDN A was levei^ transcribed ftom total RNA which has been isolated from frozen 
20 or fiesh ejaculate specimens. PGR was performed on the cDNA templates using die primers: 
forward: 5' CTGAGTrCGATGGGATGCTT 3' [SEQ ID NO:22] and 
reverse: 5' GCTGAGGACTGCAATGTGAT 3" [SEQ ID NO:23]. 

As shown inFigure 11. strong signals of the expected size of 464 bp were detected in ejaculate 
fiomnonnal males (Patients #23 and #31). while no HELA2 (testisin) was detected in Patient 
25 #153 (post-vasectomy). Patent #90 (post-vasectomy) showed a low level of amplification 
productwhichmayreflectasmaUamountofresidualspermintheseminiferoustubules. PGR 

using primers specific for (2-macroglobulin was performed on die same samples as a control 
for the presence of approximately equal amounts of cDNA in each sample. 



30 



(D) HELA2 (testisin) is Expressed in Immature Germ Cells of the Testis 
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In situ hybridization was performed on paraffin-embedded specimens of rat testis tissue using 
DIG- labeUed HELA2 (testisin) RNA probes (T3 and T7 generated transcripts containing 
nucleotides 1-423 of HELA2 cDNA). The results using the antisense RNA probe showed 
strong positive staining near the basal lamina of the seminiferous tubules in the region 
5 associated with spermatocytes and spermatogonia (Figure 12, see arrows). HELA2 (testisin) 
mRNA expression did not appear to be associated with Leydig cells and the pattern was not 
typical for Sertoli ceU staining. The presence of HELA2 (testisin) mRNA in these cells indicates 
a role for HELA2 (testisin) in germ cell maturation and sperm development. 



10 

EXAMPLES 

HELA2 (TESTISIN) EXPRESSION IS ASSOCIATED WITH TUMOURS 
IN NON-TESTIS CELL-TYPES 



15 The tissue and cell-type distribution of testisin mRNA transcripts in tumours were determined 
by Northern hybridization analyses of RNA extracted fitom in vitro cultured tumour cells lines 
derived ftom different cancerous tissues. HELA2 (testisin) was detected in the HeLa ovarian 
carcinoma, the U937 lymphoma, and the melanoma cell line 253-3D. HELA2 (testisin) is also 
associated wiUi cDNA libraries derived fiom tumours of tiie colon, pancreas, prostate and ovary 

20 (NCBI-EST Database). The presraice of HELA2 (testisin) in tumours where it is not expressed 
normally indicates tiiat it Ukely plays a role in tumourigenesis in sevaral cell-types. 



EXAMPLE 9 
THE HELA2 (TESTISIN) GENE IS LOCATED 
25 ON HUMAN CHROMOSOME 16pl3.3 

The genetic location of testisin was mapped to tiie short arm of chromosome 16 at 16pl3.3 by 
fluorescence in-situ hybridization to normal metaphase chromosomes (Figure 13A). Screening 
of a chromosome 16 hybrid panel then sub-localised HELA2 (testisin) to the cosmid 406D6 
30 which has been mapped to tiiis region (Sood, R. et al (1997)Genomics 42: 83-95; Doggett. 
N.A. et al. (1995) Nature 377 (Suppl.):335-365. The cosmid ties between the markers 
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D16S246 and D16S468 and the gene is located just centromeric to D16S246 (Figure 13B). 
This region of the human genome is associated with high genetic instabiUty and telomeric 
reairangpments underlie a variety of common human genetic disorders. Testisin is sandwiched 
between the human disease genes PKDl (polycystic kidney disease) and tuberous sclerosis 
5 (TSC2) on the on side, and MEF (femilial mediterranean fever) and Rubenstein-Taybi syndrome 
(RSTS) on the other side as diagrammed in Figure 13B. 

EXAMPLE 10 

HELA2 (TESTISIN) mRNA AND PROTEIN EXPRESSION 
10 IS ABSENT IN TESTICULAR GERM CELL TUMOURS 

To determine whether HELA2 (testisin) may play a role in testicular tumourigenesis, HELA2 
(testisin) mRNA expression in normal testes and testicular tumour tissue obtained from 4 
patients diagnosed widi seminoma were compared by Northern blot analysis. HELA2 (testisin) 
15 mRNA was detected in normal testes from an four patients but was not detectable in tiie 
corresponding tumours (Figure 14A). This data indicates a tumour suppressor role for HELA2 
(testisin) in testicular germ cell tumours. 

Expression of HELA2 (testisin) protein in testicular tissue was examined by 
20 immunohistochemistry. Paraffin-embedded tissue sections were fixed, treated, blocked, 
incubated wiUi anti-peptide antibodies (1:10 dilution) and bound antibody detected witii tiie 
Vectastain Universal Elite ABC kit (Vector Laboratories). Negative controls were performed 
in tiie absence of antiTjody. Strong staining of HELA2 (testisin) was detected in the germ ceUs 
of normal testis (N) but was absent in the adjacent tumour tissue (T) (for example, see Figure 
25 14B). providing furth» evidwice of a tumour suppressor role for HELA2 (testisin) in testicular 
germ cell tumours. 

EXAMPLE 11 

GENOMIC ORGANISATION OF THE HELA2 (TESTISIN) GENE 

30 

The HELA2 (testism) gene is further characterised by determination of its genomic 
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organisation. Introihextron boundaries and most of the DNA sequence of the HELA2 (testisin) 
gene was detcnnined from cosmid DNA by DNA sequencing. A genomic map of HELA2 
(testisin) is given in Figure 15. The intion/exon boundaries are highly conserved relative to 
prostasin. although the sizes of the introns show considerable variation. The genomic DNA 
5 sequence with introns in tower case and exons in upper case is shown in Figure 16andinSEQ 
ID NO 25. DNA sequence analysis is being performed on RNA from tumour tissues to 
ascertain the predicted function of HELA2 (testisin) as a tumour suppressor. 

EXAMPLE 12 

10 THE HELA2 (TESTISIN) SHORT AND LONG ISOFORMS 

ARE GENERATED BY ALTERNATIVE mRNA SPLICING 

Two isoforms of HELA2 (testisin) were identified which differ by an insertion of 2 amino acids 
(Tyr-Ser) between tiie catalytic His and Asp residues. These constitute ttie long (L) and short 

15 (S) isoforms. At tiie DNA level tiieie is a corresponding insertion of 6 nucleotides which 
generates a Sfcl restriction enzyme site. PGR amplification from single strand cDN A generated 
from HeLa ceU total RNA followed by DNA sequence analysis of tiie amplified product 
demonstrated tiiat the two isoforms arc generated ttirough tiie use of two alternative mRNA 
splice sites. The DNA sequence for tiie intron and tiie flanking exons are shown in Figure 17. 

20 The resulting insertion of amino acids YS occurs 4 amino acids after tiie catalytic His residue 
of HELA2 (testisin). Preliminary molecular modeUing shows tiie presence of tiiis insertion is 
likely to alter tiie catalytic activity and/or specificity of HELA2 (testisin) for its substrates. 

EXAMPLE 13 

25 MUTATION ANALYS1S-HELA2 (TESTISIN) AS A TUMOUR SUPPRESSOR 

Intronic DNA sequence information generated above (see Example 1 1) is used to generate 
primers to amplify HELA2 (testisin) exons for SSCP analyses. Genomic DNA isolated from 
seminomas and corresponding normal testis as weU as genomic DNA from wUd-type and 
30 affected seminoma femily members arc analysed by SSCP for altered expression patterns 
indicative of genetic mutiitions. Evidence of genetic mutations are also being determined by 
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DNA sequence analysis. 

EXAMPLE 14 
HOMOLOGUES OF HUMAN HELA2 (TESTISIN) 
5 ARE PRESENT IN OTHER SPECIES 

Southern blot analysis of genomic DNA isolated from a range of species using a HELA2 
(testisin) cDNA probe shows that homologues of HELA2 (testisin) are present in hamster, 
^use. marmoset and monkey. The mouse homologue of HELA2 (testisin) was identified and 

10 obtained as an EST clone. The cDNA sequence and corresponding amino acid sequence of 
mouse HELA2 (testisin) was determined (Figure 18) and is given in SEQ ID NO 27. The 
mouse cDNA encodes a protein which contains the catalytic triad of His. Asp and Ser (cirx:les) 
and 10 cysteine residues (small boxes), and an activation site (triangle) as found in HELA2 
(testisin) The hydrophilicity plot shows the presence of a hydrophobic sequence at the carboxy 

15 terminussuggestingthepresenceofaputativemembraneanchor. comparison 

human sequences show 68. 1% homology at the cDNA level and 69. 1 % homology at the ammo 
add level. 

EXAMPLE 15 

20 HELA2 (TESTISIN) IS PART OF A CLUSTER OF 

HOMOLOGOUS GENES ON CHROMOSOME 16pl3 J 

Analvsis of DNA sequences released to NCBI databases reveals the piesence of homologues 
of HELA2 (testisin) in a cluster on Chromosome 16pl3.3. Figure 19 shows the positions of 

25 these genes, designated SPOOILA. SP002LA. SP003LA, and SP004LA, relative to HELA2 
(testisin) and the respective cosmids (Sood. R. et al ( 1997) Genomics 42: 83-95) in which they 
are located. Figure 20A. 20B and 20C show the partial cDNA and deduced amino acid 
sequences of SPOOILA. SP002LA, and SP003LA respectively. Each cDNA encodes a protein 
which contains the catalytic triad of His, Asp and Ser (circles) and 10 cysteine residues (small 

30 boxes), and an activation site (triangle) as found in HELA2 (testisin). Comparisons of the 
cDN A and amino acid sequences from the heavy chain region through to the poly A tail gives 
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the % identity with HELA2 (testisin) as foUows: 





cDNA 


Protein 


SPOOILA 


34.8% 


47.3% 


SP002LA 


41.0% 


47.1% 


SP003LA 


40.3% 


51.3% 



Each of the serine proteinases encoded by these genes show that they have carboxy tenninal 
extensions, and SP002LA is the only one with a hydrophobic carboxy tenninal tail indicative 

10 of a membrane anchored protein. Identification of an expressed sequence tag (EST) from a 
human testis cDNA Ubrary demonstrates that this gene is expressed in the testis, like HELA2 
(testisin). The location of this serine proteinase cluster on chromosome 16pl3.3 flanking 
HELA2 (testisin) suggests that these serine proteinases are also involved, like HELA2(testisin), 
in sperm maturation and development Thus ihey may constitute a proteolytic cascade which 

15 isessentialforthesepKX««es.l^ssormutationofthesege^ 
tumours and to other testicular abnormalities, such as infertility. 

EXAMPLE 16 
ATC2 SERINE PROTEINASE 

20 

ATC2 was isolated from the cDN A of PAI-2 expressing HeLa cells foUowing treatment with 
TNFandcycloheximide. A partial DNA sequence for ATC2 cDNA has been obtained which 
encompasses the sequence encoding the serine proteinase catalytic region. Additional clones 
extending to both 5' and 3" directions have been obtained. The available nucleic acid sequence 

25 of ATC2 cDNA and its deduced amino acid sequence shows that it is a member of the serine 
proteinase family with homology to hepsin. prostasin. and acrosin. It thus belongs to the same 
famUy as HELA2. The catalytic region includes the His. Asp and Ser conserved motifs. 
Preliminary Northern blot experiments have Med to detect ATC2 mRN A in total RN A isolated 
from resting HeLa cells, indicating it is not expressed in abundance in these cells, which may 

30 tiKMeforebetightlyregulated. As ATC2 was isolated ftom cells following treatment withTNF 
and cyclohexunide. its expression may be induced by these agents in HeU cells. These data 
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have potential significance for a role for ATC2 in apoptosis and cell death. ATC2 nmy be 
intracellular, extracellular or found on the cell surface and is Ukely to be involved in regulating 
cen functions. Thus ATC2 may have potential significance in the treatment of cancer and 
diseasesinvoWingdysiegulalionofceUgrowthandsurvival. The nucleotide and corresponding 
5 amino acid sequence of ATCl is shown in SEQ ID NOs: 7 and 8. respectively. 



EXAMPLE 16 
BC0N3 

10 

The deduced amino acid sequence of BC0N3 (SEQ ID NO: 10) reveals that it is novel. At both 
the DNA and protein level. BCON3 shows homology to members of the kinase famUy of 
proteins. Although it cannot be classified as a member of any particular sub-family of kinases, 
alignments of the BC»N3 protein with the conserved domains of thymidine kinases and tyrosine 
15 andserinemtteonineproteinkinasesindicatespossibbATP/GT^ 

regfons. Thus. U may be the first member of a new femily of kinases. Analysis of the translation 
product using hydrophobfcity ptots and the Prosite protein analysis algorithms indicates BC0N3 
may lack an N-terminal signal sequence (that is. it is likely to encode an intracellular protein) 
and it possesses a nuclear localization signal. BC0N3 mRNA is approximately 2300 
20 nucleotides in length cDNA sequence (SEQ ID NO:9) has been obtained covering about 95% 
of the transcript and including the 3' poly A tail. . BCON3 mRNA is expressed in most normal 
tissues as demonstrated by dot blot analysis of 50 normal tissue specimens (standardised to 8 
diferent housekeeping genes) (Clontech). (Figure 9). Analysis of BCON3 mRNA expression 
using a muttiple tissue Northern blot displaying polyA+ mRNA from 1 6 different normal tissues 
25 (Qontech) shows that BCON3 is expressed in most tissues (Figure lOB). Expression by in 
vitro transcriptionAranslation expression using a partial BCON3 cDNA fragment shows 
BC0N3 encodes a protein. Two major transcription/translation products are detected, one of 
5 IkDa. the size piedkAed from the open reading frame, and a second product of about 43kDa. 
which may represent a partial translation product (Figure 21). 

30 

Those skilled in tije art will appreciate that the invention described herein is susceptible to 
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variations and modifications other than those specifically described. It is to be understood that 
the invention includes all such variations and modifications. The invention also includes all of 
the steps, features, compositions and compounds referred to or indicated in this specification, 
individuaUy or collectively, and any and all combinations of any two or more of said steps or 
5 features. 
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SEQUENCE USTING 



(1) GENERAL INFORMATION: 

^'^^siriyjr ANTALISToni Marie and HOOPER John David 
(Other than US): AMRAD OPERATIONS PTY LTD 

(ii) TITLE OF INVENTION: NOVEL MOLECULES 

(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 
(P) STATE: VICrORL\ 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Hoppy disk 

(B) COMPUTER: IBM PC compatiTile 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Intiemational PCT Application 

(B) FILING DATE: 13-FEB-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: P05 101/97 

(B) FILING DATE: 13-FEB-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPUCATION NUMBER: PP0422/97 

(B) FILING DATE: 18-NOV-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES, DR E JOHN L 

(C) REFERENCE/DOCKET NUMBER: EJH/AF 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 

(C) TELEX:AA 31787 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ACAGAATTCT GGGTIGTIAC IGCIGCICAY TG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNESS : s ing le 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ACAGAATTCA XIGGICCICC IC/GT/AXTCICC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 965 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGCGGGAGAG GAGGCC ATC GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 49 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
1 5 10 

CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 91 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TCC GGC CGA CGG 6TC ATC ACG TCG CGC ATC GTG 145 
Pro Leu Ser Gly Pro Cys Gly Arg Arg Val lie Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 193 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CX3C CTG TGG GAT TCC CAC GTA TGC GGA GTG AGC CTG CTC AGC CAC CGC 241 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 70 75 

TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACT GAC CTT AGT GAT CCC 289 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro 
80 85 90 

TCC GGG TGG ATG GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA TCC TTC 337 
ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr ser Met Pro Ser Phe 
95 100 105 

TGG AGC CTG CAG GCC TAC TAC ACC CGT TAG TTC GTA TCG AAT ATC TAT 385 
Trp ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr 
110 115 120 

CTG AGC CCT CGC TAC CTG GGG AAT TCA CCC TAT GAC ATT GCC TTG GTG 433 
Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val 
125 130 135 

AAG CTG TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC ATC TGT 481 
Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His He Gin Pro He Cys 
140 145 150 155 

CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TGC TGG GTG 529 
Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val 
160 165 170 
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721 



769 



ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT CCC CAC 577 
Thr Gly Trp Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser Pro His 
175 180 185 

ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT AIG TGC AAC 625 
Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn 
190 195 200 

CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA GAC ATG 673 
His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly Asp Met 
205 210 215 

GTT TCT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC GGT GAC 
Val cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp 
220 225 230 235 

TCA GGT GGA CCC TTG GCC TCT AAC AAG GAT GGA CTG TGG TAT CAG ATT 
Ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr Gin He 
240 245 250 

GGA GTC GTG AGC TCG GGA GTC GGC TCT GGT CGG CCC AAT CGG CCC GGT 817 
Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg Pro Gly 
255 260 265 

GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TCG ATC CAG AAG CTC ATC 865 
val Tyr Thr Asn He Ser His His Phe Glu Trp He Gin Lys Leu Met 
270 275 280 

GCC CAG AGT GGC ATC TCC CAG CCA GAC CCC TCC TCG CCG CTA CTC TTT 913 
Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe 
285 290 295 

TTC CCT CTT CTC TGG GCT CTC CCA CTC CTG GGG CCG GTC TCA 961 
Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Oly Pro Val * 
300 305 310 

GCCTACCTCA GCCCATCCAG CCTCGGGCCA CTGCCAAGTC AGGCCCTGGT TCTCTTCTCT 1015 
AATAAACACA TTCCAGTTGA TGCCTTGCA6 GGCATTTTTC AAAAAAAAAA 1075 



AAAAAAAAAA AAAAAAAAA 



1094 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 
15 10 15 

Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val He Thr Ser Arg He Val Gly Gly Glu Asp Ala 
35 40 45 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His Val Cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 70 75 80 

Ala His Cys Phe Glu Thr Asp Leu Ser Asp Pro Ser Gly Trp Met Val 
85 90 95 

Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu Gin Ala 
100 105 110 

Tyr Tyr Thr Arg Tyr Phe Val Ser Asn lie Tyr Leu Ser Pro Arg Tyr 
115 120 125 

Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val Lys Leu Ser Ala Pro 
130 135 140 

Val Thr Tyr Thr Lys His He Gin Pro He Cys Leu Gin Ala Ser Thr 
145 150 155 160 

Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val Thr Gly Trp Gly Tyr 
165 170 175 

He Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Glu Val 
180 185 190 
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Gin Val Ala lie He Asn Asn Ser Met Cys Asn His Leu Phe Leu Lys 
195 



200 205 



Tyr ser Phe Arg Lys Asp He Phe Gly Asp Met Val Cys Ala Gly Asn 
210 215 220 

Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly Pro Leu 
225 230 235 240 

Ala cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val Val Ser Trp 
245 250 255 

Gly val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr Asn He 
260 265 270 

Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser Gly Met 
275 280 285 

Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu Leu Trp 
290 295 300 

Ala Leu Pro Leu Leu Gly Pro Val * 
305 310 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEiJGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 961 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGCGGGAGAG GAGGCC ATG GGC GCG CGC GGG GCG CTG CTG CTG GCG CTG 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu 
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CTG CTG GCT CGG GCT GGA CTC AGG AAG CCG GAG TCG CAG GAG GCG GCG 
Leu Leu Ala Arg Ala Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala 
15 20 25 

CCG TTA TCA GGA CCA TGC GGC CGA CGG GTC ATC ACG TCG CGC ATC GTG 
pro Leu Ser Gly Pro Cys Gly Arg Arg Val He Thr Ser Arg lie Val 
30 35 40 

GGT GGA GAG GAC GCC GAA CTC GGG CGT TGG CCG TGG CAG GGG AGC CTG 
Gly Gly Glu Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu 
45 50 55 

CGC CTG TGG GAT TCC CAC GTA TCC GGA GTG AGC CTG CTC AGC CAC CGC 
Arg Leu Trp Asp Ser His Val Cys Gly Val Ser Leu Leu Ser His Arg 
60 65 ^0 75 

TGG GCA CTC ACG GCG GCG CAC TGC TTT GAA ACC TAT AGT GAC CTT ACT 
Trp Ala Leu Thr Ala Ala His Cys Phe Glu Thr Tyr Ser Asp Leu Ser 
80 85 90 

GAT CCC TCC GGG TGG AT6 GTC CAG TTT GGC CAG CTG ACT TCC ATG CCA 
Asp Pro Ser Gly Trp Met Val Gin Phe Gly Gin Leu Thr Ser Met Pro 
95 100 105 

TCC TTC TGG AGC CTG CAG GCC TAC TAC ACC CGT TAC TTC GTA TCG AAT 
Ser Phe Trp Ser Leu Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn 
lie 115 120 

ATC TAT CTG AGC CCT CGC TAC CTG GGG AAT TCA CCC TAT GAC ATT GCC 
He Tyr Leu Ser Pro Arg Tyr Leu Gly Asn Ser Pro Tyr Asp He Ala 
125 130 135 

TTG GTG AAG CTG TCT GCA CCT GTC ACC TAC ACT AAA CAC ATC CAG CCC 
Leu Val Lys Leu Ser Ala Pro Val Thr Tyr Thr Lys His He Gin Pro 
140 145 150 155 

ATC TGT CTC CAG GCC TCC ACA TTT GAG TTT GAG AAC CGG ACA GAC TGC 
He cys Leu Gin Ala Ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys 
160 165 170 



97 



145 



193 



241 



289 



337 



385 



433 



481 



529 



TGG GTG ACT GGC TGG GGG TAC ATC AAA GAG GAT GAG GCA CTG CCA TCT 
Trp val Thr Gly Trp Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser 
175 180 185 



577 
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CCC CAC ACC CTC CAG GAA GTT CAG GTC GCC ATC ATA AAC AAC TCT ATG 
Pro His Thr Leu Gin Glu val Gin Val Ala He He Asn Asn Ser Met 
190 195 200 

TGC AAC CAC CTC TTC CTC AAG TAC AGT TTC CGC AAG GAC ATC TTT GGA 
Cys Asn His Leu Phe Leu Lys Tyr Ser Phe Arg Lys Asp He Phe Gly 
205 210 215 

GAC ATG GTT TGT GCT GGC AAT GCC CAA GGC GGG AAG GAT GCC TGC TTC 
ASP Met val cys Ala Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe 
220 225 230 235 

GGT GAC TCA GGT GGA CCC TTG GCC TGT AAC AAG GAT GGA CTG TGG TAT 
Gly ASP ser Gly Gly Pro Leu Ala Cys Asn Lys Asp Gly Leu Trp Tyr 
240 245 250 

CAG ATT GGA GTC GTG AGC TGG GGA GTG GGC TGT GGT CGG CCC AAT CGG 
Gin He Gly Val Val Ser Trp Gly Val Gly Cys Gly Arg Pro Asn Arg 
255 260 265 

CCC GGT GTC TAC ACC AAT ATC AGC CAC CAC TTT GAG TGG ATC CAG AAG 
Pro Gly val Tyr Thr Asn He Ser His His Phe Glu Trp He Gin Lys 
270 275 280 

CTG ATC GCC CAG AGT GGC ATC TCC CAG CCA GAC CCC TCC TGG CCG CTA 
Leu Met Ala Gin Ser Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu 
285 290 295 

CTC TTT TTC CCT CTT CTC TGG GCT CTC CCA CTC CTC GGG CCG GTC TGAGCCTACC 
968 

Leu Phe Phe Pro Leu Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
300 305 310 315 

TGAGCCCATC CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC TCTCTT6TTT 1028 
GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTT TTCAAAAAAA AAAAAAAAAA 1088 

1100 

AAAAAAAAAA AA 



673 



721 



769 



817 



865 



913 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
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(D) TOP0I/X3V: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Met Gly Ala Arg Gly Ala Leu Leu Leu Ala Leu Leu Leu Ala Arg Ala 



10 



15 



Gly Leu Arg Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro 
20 25 30 

Cys Gly Arg Arg Val He Thr Ser Arg He Val Gly Gly Glu Asp Ala 
35 40 *5 

Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp Ser 
50 55 60 

His val cys Gly Val Ser Leu Leu Ser His Arg Trp Ala Leu Thr Ala 
65 75 80 

Ala His cys Phe Glu Thr Tyr Ser Asp Leu Ser Asp Pro Ser Gly Trp 



85 



90 95 



Met val Gin Phe Gly Gin Leu Thr Ser Met Pro Ser Phe Trp Ser Leu 



100 



105 110 



Gin Ala Tyr Tyr Thr Arg Tyr Phe Val Ser Asn He Tyr Leu Ser Pro 
115 



120 125 



Leu Gly Asn Ser Pro Tyr Asp He Ala Leu Val Lys Leu Ser 
135 140 

val Thr Tyr Thr Lys His He Gin Pro He Cys Leu Gin Ala 
150 155 ISO 

ser Thr Phe Glu Phe Glu Asn Arg Thr Asp Cys Trp Val Thr Gly Trp 



Arg Tyr 
130 

Ala Pro 
145 



165 



170 175 



Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin 
180 185 190 

Glu val Gin Val Ala He He Asn Asn Ser Met Cys Asn His Leu Phe 
195 200 205 
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Leu Lys Tyr 
210 



ser Phe Arg Lys Asp lie Phe Gly Asp Met Val Cys Ala 
215 220 



Gly Asn Ala Gin Gly Gly Lys Asp Ala Cys Phe Gly Asp Ser Gly Gly 
225 230 235 240 



Pro Leu 



Ser Trp 



Ala cys Asn Lys Asp Gly Leu Trp Tyr Gin He Gly Val val 



245 



250 



255 



Gly val Gly Cys Gly Arg Pro Asn Arg Pro Gly Val Tyr Thr 



260 



265 



270 



Asn 



He Ser His His Phe Glu Trp He Gin Lys Leu Met Ala Gin Ser 



275 



280 



285 



Gly Met Ser Gin Pro Asp Pro Ser Trp Pro Leu Leu Phe Phe Pro Leu 
290 295 300 



Leu Trp Ala Leu Pro Leu Leu Gly Pro Val 
305 310 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 24., 799 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGTTCAGATG AATGGGACTG TGA GAA CCA TCT GTG ACC AAA TTG ATA CAG 50 

Glu Pro Ser Val Thr Lys Leu He Gin 
1 5 



GAA CAG GAG AAA GAG CCG CGG TGG CTG ACA TTA CAC TCC AAC TGG GAG 



98 
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Glu Gin Glu Lys Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu 



•ic 20 
10 " 



25 



A6C CTC AAT GGG ACC ACT TTA CAT GAA CTT GTA GTA AAT GGG CAG TCT 
ser Leu Asn Gly Thr Thr Leu His Glu Leu Val Val Asn Gly Gin Ser 
30 35 40 

TCT GAG AGC AGA AGT AAA ATT TCT CTT CTG TGT ACT AAA CAA GAC TGT 
Cys Glu ser Arg Ser Lys He Ser Leu Leu Cys Thr Lys Gin Asp Cys 
45 50 55 

GGG CGC CGC CCT GCT GCC CGA ATC AAC AAA AGG ATC CTT GGA GGT CGG 
Gly Arg Arg Pro Ala Ala Arg Met Asn Lys Arg He Leu Gly Gly Arg 
60 65 70 

ACG AGT CGC CCT GGA AGG TGG CCA TGG CAG TGT TCT CTG CAG AGT GAA 
Thr ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu 
75 80 85 

CCC AGT GGA CAT ATC TCT GGC TGT GTC CTC ATT GCC AAG AAG TGG GTT 
Pro ser Gly His He Cys Gly Cys Val Leu He Ala Lys Lys Trp Val 
90 95 100 105 

GTC ACA GTT GCC CAC TGC TTC GAG GGG AGA GAG AAT GCT GCA GTT TGG 
val Thr val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp 
110 



115 120 



AAA GTC GTC CTT GGC ATC AAC AAT CTA GAC CAT CCA TCA GTC TTC ATC 
Lys val val Leu Gly He Asn Asn Leu Asp His Pro Ser Val Phe Met 
125 



130 135 



CAG ACA CGC TTT GTG AGG ACC ATC ATC CTC CAT CCC CGC TAC AGT CGA 
Gin Thr Arg Phe Val Arg Thr He He Leu His Pro Arg Tyr Ser Arg 
140 145 150 

GCA GTC GTC GAC TAT GAC ATC AGC ATC GTT GAG CTC AGT GAA GAC ATC 
Ala val val Asp Tyr Asp He Ser He Val Glu Leu Ser Glu Asp He 
155 160 165 



AGT GAG ACT GGC TAC GTC CGG CCT GTC TCC TTC CCC AAC CCG GAG CAG 
ser Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin 
170 



175 180 185 



TGG CTA GAG CCT GAC ACG TAC TCC TAT ATC ACA GGC TGG GGC CAC ATC 
Trp Leu Glu Pro Asp Thr Tyr Cys Tyr He Thr Gly Trp Gly His Met 



146 



194 



242 



290 



338 



386 



434 



482 



530 



578 



626 
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IQc 200 

190 



GGC AAT AAA ATG CCA TTT AAG CTG CAA GAG GGA GAG <^ COC ATT ATT 
Gly Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg He He 
205 



210 215 



TCT CTG GAA CAT 1«T CAG TCC TAC TTT GAC ATG AAG ACC ATC ACC ACT 
Ser Leu Glu His Cys Gin Ser Tyr Phe Asp Met Lys Thr He Thr Thr 

09c: 230 
220 22b 

CGG ATG ATA TGT GCT GGC TAT GAG TCT GGC ACA GTT GAT TCA TGC ATG 
Arg Met He Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met 
235 240 245 

GGT GAC TGG GGC GGT CCG TTG AAT TCT GT 
Gly Asp Trp Gly Gly Pro Leu Asn Ser 
250 255 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 8: 

Glu pro ser Val Thr Lys Leu He Gin Glu Gin Glu Lys Glu Pro Arg 
1 5 10 

Trp Leu Thr Leu His Ser Asn Trp Glu Ser Leu Asn Gly Thr Thr Leu 



20 



25 30 



His Glu Leu val Val Asn Gly Gin Ser Cys Glu Ser Arg Ser Lys lie 
35 40 *^ 

ser Leu Leu Cys Thr Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg 



55 60 



SO 

Met Asn Lys Arg lie Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp 
65 



674 



722 



770 



799 



70 75 80 
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Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro Ser Gly His He Cys Gly 



85 



90 95 



Cys Val Leu 

100 



He Ala Lys Lys Trp Val Val Thr Val Ala His Cys Phe 



105 



Glu Gly Arg 
115 



Glu Asn Ala Ala Val Trp Lys Val Val Leu Qly He Asn 



120 125 



Asn Leu Asp His Pro Ser Val Phe Met Gin Thr Arg Phe Val Arg Thr 



130 



135 140 



He He Leu His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp He 
145 150 155 

ser He Val Glu Leu Ser Glu Asp He Ser Glu Thr Gly Tyr Val Arg 
165 170 175 

pro val cys Leu Pro Asn Pro Glu Gin Trp Leu Glu Pro Asp Thr Tyr 



180 



185 



190 



cys Tyr He Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys 
195 200 205 



Leu 

210 



Gin Glu Gly Glu val Arg He He Ser Leu Glu His Cys Gin Ser 



215 220 



Tyr Phe Asp Met Lys Thr He Thr Thr Arg Met He Cys Ala Gly Tyr 
225 230 235 240 



Glu Ser Gly Thr Val Asp Ser Cys Met Gly Asp Trp Gly Gly Pro Leu 
245 



250 255 



Asn Ser 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 1773 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATTTAATACG ACTCACTATA GGGAATTTGG CCCTCGAGOA AGAATTCGGC ACGAGGCTGC 

GGCGCACTCT GAGGGAGTCG CTGTCATCCG GGGCCCCGAA CCCGACTGGA GCTGAAGCGC 

AGGCTGCGGG GCGCGGAGTC GGGAGGCCTG AGTGTTCCTT CCAGC ATG TOG GAG 

Met Ser Glu 



GGG GAG TCC GAG ACA GTA CTT AGO AGT GGC TCA GAG CCA AAG GTA GAA 
Gly Glu ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro Lys Val Glu 
5 10 " 

TCT TCA TCT TCA GCT CCT GGC CTC ACA TCA GTG TCA CCT CCT GTG ACC 
ser ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro Pro Val Thr 
20 



25 30 



TCC ACA ACC TCA GCT GCT TCC CCA GAG GAA GAA GAA GAA AGT GAA GAT 
ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu Ser Glu Asp 
40 45 50 

GAG TCT GAP ATT TTG GAA GAG TCG CCC TGT GGG CGC TGG CAG AAG AGG 
Glu ser Glu lie Leu Glu Glu Ser Pro Cys Gly Arg Trp Gin Lys Arg 
55 60 65 

CGA GAA GAG GTG AAT CAA CGG AAT GTA CCA GGT ATT GAC AGT GCA TAC 
Arg Glu Glu val Asn Gin Arg Asn Val Pro Gly He Asp Ser Ala Tyr 



70 



75 80 



CTG GCC ATG GAT ACA GAG GAA GGT GTA GAG GTT GTG TGG AAT GAG GTA 
Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp Asn Glu Val 
85 90 95 

CAG TTC TCT GAA CGC AAG AAC TAC AAG CTG CAG GAG GAA AAG GTT TGT 
Oln Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu Lys Val Cys 
100 105 110 1" 



60 
120 
174 

222 

270 

318 

366 

414 

462 

510 



GCT 



GTG TTT GAT AAT TTG ATT CAA TTG GAG CAT CTT AAC ATT GTT AAG 558 
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Ala Val Phe Asp Asn Leu He Gin Leu Glu His Leu Asn He Val Lys 
120 125 130 

rrr CAC AAA tat TGG GCT gag ATT AAA gag AAC AAG GCC AGG GTC ATT 
Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala Arg Val He 
135 140 1*5 

TTT ATC ACA GGA TAG ATG TCA TCT GGG AGT CTG AAG CAA TTT CTG AAG 
Phe lie Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin Phe Leu Lys 
150 155 1«0 

AAG ACC CAA AAG AAC CAC CAG ACG ATG AAT GAA AAG GCA TGG AAG CGT 
Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala Trp Lys Arg 
165 "0 

TGG TGC ACA CAA ATC CTC TCT GCC CTA AGC TAC CTG CAC TCC TGT GAC 
Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His Ser Cys Asp 
180 



185 190 195 



CCC CCC ATC ATC CAT GGG AAC CTG ACC TGT GAC ACC ATC TTC ATC CAG 
pro pro lie He His Gly Asn Leu Thr Cys Asp Thr He Phe He Gin 
200 205 210 

CAC AAC GGA CTC ATC AAG ATT GGC TCT GTG GCT CCT GAC ACT ATC AAC 
His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp Thr He Asn 
215 220 225 

AAT CAT GTG AAG ACT TGT CGA GAA GAG CAG AAG AAT CTA CAC TTC TTT 
Asn H^s val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu His Phe Phe 
230 235 240 

GCA CCA GAG TAT GGA GAA GTC ACT AAT GTG ACA ACA GCA GTG GAC ATC 
Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala Val Asp He 
245 250 255 

TAC TCC TTT GGC ATC TGT GCA CTG GGG ATG GCA GTG CTG GAG ATT CAG 
Tyr ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu Glu He Gin 
260 265 270 275 

GGC AAT GGA GAG TCC TCA TAT GTG CCA CAG GAA GCC ATC AGC AGT GCC 
Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala He Ser Ser Ala 
280 285 290 

ATC CAG CTT CTA GAA GAC CCA TTA CAG AGG GAG TTC ATT CAA AAG TGC 
He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He Gin Lys Cys 



606 



654 



702 



750 



798 



846 



894 



942 



990 



1038 



1086 
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295 



300 



305 



CAG TCT GAG CCT GCT CGC AGA CCA ACA GCC AGA GAA CTT CTG 
I,eu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu Leu Leu Phe 
310 



315 320 



CAC CCA GCA TTG TTT GAA GTC CCC TCG CTC AAA CTC CTT GCG GCC CAC 
His pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu Ala Ala His 
325 



330 335 



TGC ATT GTG GGA CAC CAA CAC ATG ATC CCA GAG AAC GCT CTA GAG GAG 
Cys lie val Gly His Gin His Met He Pro Glu Asn Ala Leu Glu Glu 

Tsn 355 

340 



345 350 



ATC ACC AAA AAC ATC GAT ACT AGT GCC GTA CTG GCT GAA ATC CCT GCA 
lie Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu He Pro Ala 
360 365 370 

GGA CCA GGA AGA GAA CCA GTT CAG ACT TTG TAC TCT CAG TCA CCA GCT 
Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin Ser Pro Ala 
375 380 385 

CTG GAA TTA GAT AAA TTC CTT GAA GAT GTC AGG AAT GGG ATC TAT CCT 
Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly He Tyr Pro 
390 395 400 

CTG ACA GCC TTT GGG CTC CCT CGG CCC CAG CAG CCA CAG CAG GAG GAG 
Leu Thr Ala Phe Gly Leu Pro ATg Pro Gin Gin Pro Gin Gin Glu Glu 



405 



410 



GT« ACA TCA CCT GTC GTG CCC CCC TCT G^ AAG ACT CCG ACA CCT GAA 
val Thr ser Pro Val Val Pro Pro Ser Val Lys Thr Pro Thr Pro Glu 



420 



425 



430 



435 



CCA GCT GAG GTG GAG ACT CGC AAG GTC GTC CTC ATC CAG TCC AAC ATT 
Pro Ala Glu val Glu Thr Arg Lys Val Val Leu Met Gin Cys Asn He 
440 



445 450 



GAG TC6 GTC GAG GAG GGA GTC AAA CAC CAC CTC ACA CTT CTC CTC AAG 
Glu ser val Glu Glu Gly Val Lys His His Leu Thr Leu Leu Leu Lys 
455 



460 465 



TTC GAG GAC AAA CTC AAC CGG CAC CTC AGC TCT GAC CTC ATC CCA AAT 
Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu Met Pro Asn 
47b 475 480 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



1518 



1566 



1614 
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GAG AAT ATC CCC GAG TTG GCG GCT GAG CTG GTG CAG CTG GGC TTC ATT 
Glu Asn lie Pro Glu Leu Ala Ala Glu Leu Val Gin Leu Gly Phe He 



485 



490 495 



AGT GAG GCT GAC CAG AGC CGG TTG ACT TCT CTG CTA GAA GAG ACC TTG 
Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu Glu Thr Leu 
500 505 510 515 

AAC AAG TTC AAT TTT GCC AGG AAC AGT ACC CTC AAC TCA GCC GCT GTC 
Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser Ala Ala Val 



520 



525 530 



ACC GTC TCC TCT TAGAGCTCAC TCGGGCCAGG CCCTGATCTG CGCTGTGGCT 

Thr Val Ser Ser 
535 

GTCCCTGGAC GTGCTGCAGC CCTCCTGTCC CTTCCCCCCA GTCAGTATTA CCCTGTGAAG 
CCCCTTCCCT CCTTTATTAT TCAGGAGGGC TGGQGGGGCT CCCTGGTTCT GAGCATCATC 
CTTTCCCCTC CCCTCTCTTC CTCCCCTCTG CACTTTGTTT ACTTGTTTTG CACAGACGTG 
GGCCTGGGCC TTCTCAGCAG CCGCCTTCTA GTTGGGGGCT AGTCGCTGAT CTGCCGGCTC 
CCGCCCAGCC TGTGTGGAAA GGAGGCCCAC GGGCACTAGG GGAGCCGAAT TCTACAATCC 
CGCTGGGGCG GCCGGGGCGG GAGAGAAAGG O^TGCTGCA GTGGTGGCCC TGGGGGGCCA 
TTCGATTCGC CTCAGTTGCT GCTGTAATAA AAGTCTACTT TTTGCTAAAA AAAAAAAAAA. 
AAAAAAAAAA A 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met ser Glu Gly Glu Ser Gin Thr Val Leu Ser Ser Gly Ser Asp Pro 



1662 

1710 

1758 

1810 

1870 

1930 

1990 

2050 

2110 

2170 

2230 

2241 
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15 " ^5 

LYS Val Glu Ser Ser Ser Ser Ala Pro Gly Leu Thr Ser Val Ser Pro 
20 25 30 



pro val Thr Ser Thr Thr Ser Ala Ala Ser Pro Glu Glu Glu Glu Glu 
35 



40 45 



ser Glu Asp Glu Ser Glu He Leu Glu Glu Ser Pro Cys Gly Arg Trp 
50 55 60 

Gin Lys Arg Arg Glu Glu Val Asn Gin Arg Asn Val Pro Gly He Asp 



65 



70 75 80 



ser Ala Tyr Leu Ala Met Asp Thr Glu Glu Gly Val Glu Val Val Trp 
85 90 

Asn Glu val Gin Phe Ser Glu Arg Lys Asn Tyr Lys Leu Gin Glu Glu 
100 105 110 

Lys val Cys Ala Val Phe Asp Asn Leu He Gin Leu Glu His Leu Asn 



115 



120 



125 



He val Lys Phe His Lys Tyr Trp Ala Asp He Lys Glu Asn Lys Ala 
130 



135 140 



Arg 
145 



val He Phe He Thr Gly Tyr Met Ser Ser Gly Ser Leu Lys Gin 
150 155 160 



Phe Leu Lys Lys Thr Gin Lys Asn His Gin Thr Met Asn Glu Lys Ala 
165 



170 175 



Trp Lys Arg Trp Cys Thr Gin He Leu Ser Ala Leu Ser Tyr Leu His 
180 185 190 

ser cys Asp Pro Pro He He His Gly Asn Leu Thr Cys Asp Thr He 
195 200 205 

Phe He Gin His Asn Gly Leu He Lys He Gly Ser Val Ala Pro Asp 
210 215 220 



Thr He Asn Asn His Val Lys Thr Cys Arg Glu Glu Gin Lys Asn Leu 
225 230 235 240 

His Phe Phe Ala Pro Glu Tyr Gly Glu Val Thr Asn Val Thr Thr Ala 
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245 



250 255 



Val Asp lie Tyr Ser Phe Gly Met Cys Ala Leu Gly Met Ala Val Leu 
260 265 270 

Glu lie Gin Gly Asn Gly Glu Ser Ser Tyr Val Pro Gin Glu Ala He 
275 280 285 

Ser ser Ala He Gin Leu Leu Glu Asp Pro Leu Gin Arg Glu Phe He 
290 295 300 

Gin Lys cys Leu Gin Ser Glu Pro Ala Arg Arg Pro Thr Ala Arg Glu 
305 310 315 320 

Leu Leu Phe His Pro Ala Leu Phe Glu Val Pro Ser Leu Lys Leu Leu 
325 330 335 

Ala Ala His Cys He Val Gly His Gin His Met He Pro Glu Asn Ala 
340 345 350 

Leu Glu Glu He Thr Lys Asn Met Asp Thr Ser Ala Val Leu Ala Glu 
355 360 365 

He Pro Ala Gly Pro Gly Arg Glu Pro Val Gin Thr Leu Tyr Ser Gin 
370 375 380 



Ser Pro Ala Leu Glu Leu Asp Lys Phe Leu Glu Asp Val Arg Asn Gly 
385 390 395 400 

He Tyr Pro Leu Thr Ala Phe Gly Leu Pro Arg Pro Gin Gin Pro Gin 



405 



410 415 



Gin Glu Glu val Thr Ser Pro Val Val Pro Pro Ser Val Lys Thr Pro 
420 425 430 

Thr Pro Glu Pro Ala Glu Val Glu Thr Arg Lys Val Val Leu Met Gin 
435 440 445 

Cys Asn He Glu Ser Val Glu Glu Gly Val Lys His His Leu Thr Leu 
450 455 460 

Leu Leu Lys Leu Glu Asp Lys Leu Asn Arg His Leu Ser Cys Asp Leu 
465 470 475 480 

Met Pro Asn Glu Asn He Pro Glu Leu Ala Ala Glu Leu Val Gin Leu 



wo 98/36054 



PCT/AU98y00085 



-76- 

485 

Gly Phe lie Ser Glu Ala Asp Gin Ser Arg Leu Thr Ser Leu Leu Glu 
500 505 510 

Glu Thr Leu Asn Lys Phe Asn Phe Ala Arg Asn Ser Thr Leu Asn Ser 
515 520 525 

Ala Ala Val Thr Val Ser Ser 
530 535 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCACAGTCGA CCAAGCCGGA GTCGCAGAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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GCACAAAGCT TGCCAGGAGG GGTCTGGCTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCra: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GCACAACCAT GGCCAAGCCG GAGTCGCAGG AG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 14: 



GCACAAGATC TCCAG6AGGG GTCTGGCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 cimino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Lys Pro Glu Ser Gin Glu Ala Ala Pro Leu Ser Gly Pro Cys 
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10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu ASP Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser Leu Arg Leu Trp Asp 
5 10 15 

Cys 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Tyr He Lys Glu Asp Glu Ala Leu Pro Ser Pro His Thr Leu Gin Cys 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCACAGGTAC CGAGGCCATG GGCGCGCGC 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCACATCTAG ATCAGTGGTG GTGGTGGTGG TGGACCGGCC CCAGGAGTGG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCACAGCGGC CGCGAGGCCA TGGGCGCGCG C 
(2) INFORMATION FOR SEQ IP NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCACAGCGGC CGCTCAGTGG TGGTGGTGGT GGTGCCAGGA GGGGTCTGGC 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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CA) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTGACTTCCA TGCCATCCTT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GCTCACGACT CCAATCTGAT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Arg He Val Gly Gly 
5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) HAME/KEY: CDS 

(B) LOCATION: 2.. 856 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

C GAG CTA TTG TCA GGG CCC TGC GGT CAC AGG ACC ATC CCT TCC CGT 
ASP Leu Leu Ser Gly Pro Cys Gly His Arg Thr He Pro Ser Arg 
15 10 15 

ATA GTG GGT GGC GAT GAT GCT GAG CTT GGC CGC TGG CCG TGG CAA GGG 
He Val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly 
20 25 30 

AGC CTG CGT GTA TGG GGC AAC CAC TTA TGT GGC GCA ACC TTG CTC AAC 
ser Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Tbr Leu Leu Asn 
35 40 *5 

CGC CGC TGG GTG CTT ACA GCT GCC CAC TGC TTC CAA AAG GAT AAC GAT 
Arg Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp 
50 55 60 

CCT TTT GAC TGG ACA GTC CAG TTT GGT GAG CTG ACT TCC AGG CCA TCT 
Pro Phe Asp Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser 
65 70 75 

CTC TGG AAC CTA CAG GCC TAT TCC AAC CGT TAC CAA ATA GAA GAT ATT 286 
Leu Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin He Glu Asp He 
80 85 90 95 



TTC CTG AGC CCC AAG TAC TCG GAG CAG TAT CCC AAT GAC ATA GCC CTG 
Phe Leu ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp He Ala Leu 
100 105 110 

CTG AAG CTC TCA TCT CCA GTC ACC TAC AAT AAC TTC ATC CAG CCC ATC 
Leu Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He 
115 120 125 



46 



94 



142 



190 



238 



334 



382 
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TGC CTC CTG AAC TCC ACG TAC AAG TTT GAG AAC CGA ACT GAC TGC TGG 
CVS Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp 
130 135 "0 

GTG ACC GGC TGG GGG GCT ATT GGA GAA GAT GAG AGT CTG CCA TCT CCC 
Val Thr Gly Trp Gly Ala He Gly Glu Asp Glu Ser Leu Pro Ser Pro 
145 150 155 



AAC ACT CTC CAG GAA GTG CAG GTA GCT ATT ATC AAC AAC AGC ATG TGT 
Asn Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys 
160 



165 "0 175 



430 



478 



526 



574 



AAC CAT ATG TAC AAA AAG CCA GAC TTC CGC ACG AAC ATC TGG GGA GAC 
Asn His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp 
180 185 ISO 

ATG GTT TGC GCT GGC ACT CCT GAA GGT GGC AAG GAT GCC TGC TTT GGT 
Met val cys Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly 
195 200 205 

GAC TCG GGA GGA CCC TTG GCC TGC GAC CAG GAT ACG GTG TGG TAT CAG 
ASP ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin 
210 215 220 

GTT GGA GTT GTG AGC TGG GGA ATA GGC TGT GGT CGC CCC AAT CGC CCT 
val Gly val Val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro 
225 230 235 

GGA GTC TAT ACC AAC ATC AGT CAT CAC TAC AAC TGG ATC CAG TCA ACC 
Gly val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr 
240 245 ■ 250 255 

ATG ATC CGC AAT GGG CTG CTC AGG CCT GAC CCA GTC CCC TTG CTA CTG 
Met He Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu 
260 265 270 

TTT CTT ACT CTG GCC TCG GCT TCC TCT TTG CTC AGG CCT GCC 
Phe Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 
275 280 285 

TGAGCCCACA CGTCTACGTC ACACCTGTCA GGTCAGGGTC TCTCTCTTTT GTATCTTGCT 916 



622 



670 



718 



766 



814 



856 



TGCTAATAAA CCTGTTAATA TTTAAAAAAA AAAAAAAAAA AAA 



959 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ASP Leu Leu Ser Gly Pro Cys Oly His Arg Thr He Pro Ser Arg He 



10 



15 



Val Gly Gly Asp Asp Ala Glu Leu Gly Arg Trp Pro Trp Gin Gly Ser 
20 25 30 

Leu Arg Val Trp Gly Asn His Leu Cys Gly Ala Thr Leu Leu Asn Arg 
35 40 *5 



Arg Trp Val Leu Thr Ala Ala His Cys Phe Gin Lys Asp Asn Asp Pro 
50 55 60 

Phe ASP Trp Thr Val Gin Phe Gly Glu Leu Thr Ser Arg Pro Ser Leu 
65 70 75 80 

Trp Asn Leu Gin Ala Tyr Ser Asn Arg Tyr Gin He Glu Asp He Phe 



85 



90 95 



Ser Pro Lys Tyr Ser Glu Gin Tyr Pro Asn Asp He Ala Leu Leu 



Leu 

100 



105 110 



Lys Leu Ser Ser Pro Val Thr Tyr Asn Asn Phe He Gin Pro He Cys 
115 120 125 

Leu Leu Asn Ser Thr Tyr Lys Phe Glu Asn Arg Thr Asp Cys Trp Val 
130 135 1*0 

Thr Gly Trp Gly Ala He Gly Glu Asp Glu Ser Leu Pro Ser Pro Asn 
145 150 155 1«0 

Thr Leu Gin Glu Val Gin Val Ala He He Asn Asn Ser Met Cys Asn 
165 170 175 

His Met Tyr Lys Lys Pro Asp Phe Arg Thr Asn He Trp Gly Asp Met 
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Val Cys 

195 



180 185 190 

Ala Gly Thr Pro Glu Gly Gly Lys Asp Ala Cys Phe Gly Asp 



200 205 



Ser Gly Gly Pro Leu Ala Cys Asp Gin Asp Thr Val Trp Tyr Gin Val 
210 215 220 

Gly val val Ser Trp Gly He Gly Cys Gly Arg Pro Asn Arg Pro Gly 
225 230 235 240 

val Tyr Thr Asn He Ser His His Tyr Asn Trp He Gin Ser Thr Met 
245 250 . 255 

lie Arg Asn Gly Leu Leu Arg Pro Asp Pro Val Pro Leu Leu Leu Phe 
260 265 270 

Leu Thr Leu Ala Trp Ala Ser Ser Leu Leu Arg Pro Ala 
275 280 285 



(2) INFORMATION FOR SEQ ID NO -.27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27; 
AGTGAGTCTC CTGCCTCAGC CTCCCAAGTA GCTGGGACTT CAGGTGTGTG CCACCATCCT 
CAGCTAATTT TTTTTTTTTT TTTTTTTTTG AGAAGGAGTC TTGCTCTGTC GCCCAGGCTG 
GAGTGCAGTG 6CGCQATCTT CCAGGCCCCA CCGGGCCCTC AGGAAGGCCT TGCCTACCTG 
CTTTAAGGGG ACTCCTGGCT CAGGGCCAGG CCCCTGGTGC TGGAGGAGGT GGTGGGTGGA 240 
GGGCAGGGGG CACCAAGCGG GCAGCCAGGA CCCCCGGGCT GCAGACAAGA AAAGGACTGT 



60 
120 
180 



300 
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GGGGTCCACC GGGTCTGGGC CACATCAAGG AATGTGGTTG AAGACCCGCC CTTAGGAGCT 
GAAAGCCAGG GCGCTACCA6 GCCTCAGAGG CCCCAAACAG CCCTTGGGCC TGGTTTGGGA 
GGATTAAGCT GGAGCTCCCA ACCCGCCCTG CCCCCAGGGG GCGACCCCGG GCCCGGCGCG 
AGAGGAGGCA GAGGGGGCGT CAGGCCGCOG GAGAGGAGGC CATGGGCGCG CGCGGGGCGC 
TGCTGCTGGC GCTGCTGCTG GCTCGGGCTG GACTCAGGAA GCCGGGTGAG CTCGGGGCGC 
TGCTGGCGGG ATGGGGAGGC GGGGGAGCGG TGGGGAGGAC GGGAGGTGGA GGCCGCGGGG 
AGTCACTTCT TGTCTCCCGC AGAGTCGCAG GAGGCGGCGC CGTTATCAGG TAGGGCGCCC 
AGGACGCGCG ATTCCTGCCA GGGCCGTTGG GCCGAGGTGG ACGGGGGGCG GTGAGGGGGT 
AGAGGGGGGC CTTTACTGCT CTCTCGCCCC CGCCCCCGGG ATCGAGAACT CTGTTGGCGT 
GGAAAGTAAC TAACGGACGC TGGAGGGGGA TGGGCGGGCC CTGCAGAGCA CGTGGGAGGA 
TCTCCAGTGT CACCTACTTC CTGCTGCACA CACGCGAGGG GACCCTGGGT GGGCAAAAAC 
GTCCTTTCCC GGACQGGGTT GAAGGGGAGA AAGGGAGAGG TCGGGCTTGG GGGGCTGCCT 
CCCGCGGCTC AGCAGTTCCT CTGACCATCC GAGGACCATG CGGCCGACGG GTCATCACGT 
CGCGCATCGT GGGTGGAGAG GACGCCGAAC TCGGGCGTTG GCCGTGGCAG GGGAGCCTGC 
GCCTGTGGGA TTCCCACGTA TGCGGAGTGA GCCTGCTCAG CCACCGCTGG GCACTCACGG 
CGGCGCACTG CTTTGAAACG TGAGTGGGGG TGCGAACGGA GGGGTGCGGG GACGGGCAGG 
AACAGGGCTG GAGGGAGTGC CACCGAACTT TACCTCTGGT CTGATGCCAG ACTTGGGCGT 
GAAAGTTGTC CGTGGATGCG GCCTGGTGTT CTCCT6AGCC CCAGGCTGTG CTGCAGCCGG 
TTACACCCAC TCCAGTTCCC TTTGGGTCTC CTGGAGGGAA CCCTGTTCAG GTTATTCCAG 
AATGTTCTTC CAGAACATTT CCACACACTT TTGGGTATTC TCTCCCTTTT TCTTTCAACC 
CAAAGTTCAC CACTGACCAT CCCACCCTCA TCCCCCCTCC TGGTGGACGG TGCGGTACAG 
TGTGGGGCAC TGAGCCAAGG CCAGCACCCC CGGGCCGCTG TGTGGACTCC ATCCTGCCAA 
TCCCACATTG GCGTGGTGCA TCTCCCCATT CCTCCTTGGG CTGCATGGGG GTGCCCCTGG 
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AGGCCTTCGC TCAA-TOCAAG GCTCCTTGGG ACAGCTCTGG GAGGTGACAA GACCCCACCC 
TTCTGC-roCA G6AGCAGGTC CTAGGACTTT GGTTGTGGTC TGTCTGGGCT CCTTCATTTC 
TGCAGGGGAC CCTGGGTGTT AGCAAGTAGC AGCAACACCA CAGTTTCCCC TCCTGCACTG 
GACCCCAGTT GTGCTCAGGT AGCCAGCCCT CCATCCAGGG CCCCTGACTG CTCTCTTCTC 
TTCTGCCAGC TATAGTGACC TTAGTGATCC CTCCGGGTGG ATGGTCCAGT TTGGCCAGCT 
GACTTCCAIG CCATCCTTCT GGAGCCTCCA GGCCTACTAC ACCCGTTACT TCGTATCGAA 
TATCTATCTG AGCCCTCGCT ACCTGGGGAA TTCACCCTAT GACATTGCCT TGGTGAAGCT 
GTCTGCACCT GTCACCTACA CTAAACACAT CCAGCCCATC TGTCTCCAGG CCTCCACATT 
TGAGTTTGAG AACCGGACAG ACTGCTGGGT GACTGGCTGG GGGTACATCA AAGAG6ATGA 
GGG'TOAGGCT GGGGACAGGC GGGTCAGGGA GGAACTGTCT TTGTTCACCT GTTCCCCTGC 
ATAGGCACAA TAGCCCCCTG CTTGGTCTGG GGGTGCAGGC TATGCCCCTC TTGCTTGCA6 
TCTCTCCTCA CCTGCCAGGG CAGGGACCAA ACACCCAGTT CTCTCCCTTC CAGGGGCTGT 
GGGGGCCAGA AGGAGAGTGT GAGAGGGAGG CCAGTrrGGC GCAAGCCTGT GGGTGGTGCG 
GTGGTGGAGG GGTTCTGGAG GGCTTGGCGA CATAAACCTC ATACTTGGAT TTATTCCTGC 
ATCTTTCCAC CTCCCCCAGT GCTCACCAAT GCCCCAGGCA TCACCAGCTT GCCCCTTCCC 
CCAAGGTCTG GCTTTCGATC CTTATCTGAA CACCGTTTTA AGTTGCCTTG GCCCCTTCCT 
CGGTTCCTTT TTGGCTGAGG AATCTCTCCA TGGCTGCAGG CAGGGCCATT GTTGCCATTC 
TACAGATAGG GAAAGTGCGG CTGGGGGAGC TCT6ACAGCT GTCCCTCCCC GGGGCCTTCT 
GTGATGCTGC TGAGGGCCTC TGTTGTGCTC GGGTCTGGGT TGGAGCTGGG GGTAATGGAG 
ATGAACCTGC CAGGCACAGT GGG1GCCCCA GGGCCCCCAC CCCCGCAGCC TATGCCATCC 
CTCCATAGAG GGGCCTCAGG TTGCTGTCTC TCTCCTTCCC ACTATCGTCC GCACAGCACT 
GCCATCTCCC CACACCCTCC AGGAAGTTCA GGTCGCCATC ATAAACAACT CTATGTQCAA 
CCACCTCTTC CTCAAGTACA GTTTCCGCAA GGACATCTTT GGAGACATGG TTTGTGCTGQ 
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CAATGCCCAA GGCGGGAAGG AT«:CTGCTT CGTGAGTGTC CTTGCCACCA CTCCCAGCCC 
AGGAAAGCAT CCTCTCTCCC TC'TOCCTTAT TTGACCCTCA TGCCAACCCC GGGAGGTGGA 
GACTGTTGCC CCACTCTGCA 6ATCCAGAAA CGGAGGCTTG GCTGCTGCCA GGGGGAGGAG 
GAGGATGTCC ACCCAGTCTA CCCAGCCCCA TAGCCCTTCC CACTCTCAGC CCCTCCCCTG 
CCCCACTCAC TCTOCCCCAG GCTGACCTCA GCCCCGCTGC TCCCCAGGGT GACTCAGGTG 
GACCCTTGGC CTGTAACAAG AAl^GACTGT GGTATCAGAT TGGAGTCGTG AGCTGGG6AG 
TGGGCTG1«G TCGGCCCAAT CGGCCCGGTG TCTACACCAA TATCAGCCAC CACTTTGAGT 
GGATCCAGAA GCTGATGGCC CAGAG^KSGCA OKSTCCCAGCC AGACCCXTTCC TGGCCGCTAC 
TCTTTTTCCC TCTTCTCTGG GCTCTCCCAC TCCTGGGGCC GGTCTGAGCC TACCTGAGCC 
CATGCAGCCT GGGGCCACTG CCAAGTCAGG CCCTOGTTCT CTTCTGTCTT GTTTGGTAAT 
AAACACATTC CAGTTCATGC CT1«CAGGGC A-^CTTCAAA AGCAGTGGCT TCAT6GACAG 
CTCATTCTCT CTTGTGCAGA CAGCCTGTCT GTGCCCCTGG CTCACACCCA CATCTGTTCT 
GCACCATAGA ACCATCT«3T TATTTCGATC AGAAAGAGAA TTGTGTGTT6 CCCAGGCTGG 
TCTTGAACGC CTAGQGTGTC TCGATC 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
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CTCAACCGGG TTXJI^GGCGG CGAGGACAGC ACl^ACAGCG AGTGGCCCTG GATCGTGAGC 
Al^CAGAAGA AT^^GACCCA CCACTGCGCA GGrTCTCl^C TCACCAGCCG CTGGGTGA'K: 
ACTGCTGCCC ACTGTTTCAA GGACAACCTC AACAAACCAT ACCI^TTCTC T^ltJCTGCTG 
OGGGCCTGGC AGCI^GGGAA CCCTCGCTCT CGGTCCCAGA AGGTGGGTGT TGCCTGGGTG 
a;«5CCCCACC CTGOVPATTC CTGGAAGGAA GGTGCCTGTG CAGACArTGC CCTGGTGCGT 

c«:gagcgct ccatacagtt ctcagagcgg gtcctgccca tctgcctacc tgatgcctct 

ATCCACCTCC CTCCAAACAC CCACTGCTGG ATCTCAGGCT GGGGGAGCAT CCAAGATGGA 
GTTCCCTTGC CCCACCCTCA GACCC1«CAG AAGCTGAAGG TTCCTATCAT CGACTCGGAA 
GTCTGCAGCC ATCTGTACTG GCGGGGAGCA GGACAGGGAC CCATCACTGA GGACATGCTG 
TGTCCCX^GCT ACrreGAGGG GOAGCGGGAT GCTTCTCTGG GCGACTCCGG GGGCCCCCTC 
ATOTGCCAGG TGGACGGCGC C^GCTGCTC GCCGGCATCA TCAGC1«GGG CGAGGGCTGT 
GCCGAGCGCA ACAGGCCCGG GGTCTACATC AGCCTCTCTG CGCACCGCTC CTGGGTGGAG 
AAGATCGTGC AAGGGGK3CA GCTCCGCGGG CGCGCTCAGG GGGGTGGGGC CCTCAGGGCA 
CCGAGCCAGG GCTCTGGGGC CGCCGCGCGC TCCTAGGGCG CAGCGGGACG CGGGGCTCGG 
ATCTGAAAGG CGGCCAGATC CACATCTCG?. TCTGGATCTC CGGCGGCCTC GGGCGGTTTC 
CCCCGCCGTA AATAGGCTCA TCTACCTCTA CCT<iTGGGGG CCCGG,.CGGC TGCTGCGGAA 
AGGAAACCCC CTCCCCGACC CGCCCGACGG CCTCAGGCCC CGCCTCCAAG GCATCAGGCC 
CCGCCCAACG GCCTCATGTC CCCGCCCCCA CGACTTCCGG CCCCGCCCCG GGCCCCAGCG 
CTTTTGTGTA TATAAATGTT AAT6ATTTTT ATAGGTATTT GTAACCCTGC CCACATATCT 
TATTTATTCC TCCAATTTCA ATAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1165 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AATGCGGCCA CTCCAAGGAG GCCGGGAGGA TTCTGGGAGG CCAAGACACC CAGGAAGGAC 
GCTGGCCGTG GCAGGTTGGC CTCTCG^TGA CCTCAGTGGG GCATGTATGT GGGGGCTCCC 
TCATCCACCC ACGCTCGGTG CTCACAGCCG CCCACTGCTT CCTGAGGTCT GAGGATCCCG 
GGCTCTACCA TGTTAAAGTC GGAGGGCTGA CACCCTCACT TTCAGAGCCC CACTCGGCCT 
TGGTGGCTGT GAGGAGGCTC CTCGTCCACT CCTCATACCA TGGGACCACC ACCAGCGGGG 
ACATTGCCCT GATGGAGCTG GACTCCCCCT TGCAGGCCTC CCAGTTCAGC CCCATCTGCC 
TCCCAGGACC CCAGACCCCC CTCGCCATTG GGACCGTGTG CTGGGTAAAC GGGCTGGGGG 
TCCACTCAGG AGAGGCCCTG GCGAGTCTCC TTCAGGAGGT GGCTGTGCCC CTCCTGGACT 
CGAACATGTC TGAGCTGATC TACCACCTAG GAGAGCCCAG CCTGGCTGGC CAGCGCCTCA 
TCCAGGACGA CATCCTCTGT GCTGGCTCTG TCCAGGGCAA GAAAGACTCC TGCCAGGGTG 
ACTCCGGGGG GCCGCTOGTC TGCCCCATCA ATGATACGTG GATCCAGGCC GGCATTGTGA 
GCTGGGGATT CGGCTGTGCC CGGCCTTTCC GGCCTGGTGT CTACACCCAG GTGCTAAGCT 
ACACAGACTG GATTCAGAGA ACCCTGGCTG AATCTCACTC AGGCATGTCT GGGGCCCGCC 
CAGGTGCCCC AGGATCCCAC TCAGGCACCT CCAGATCCCA CCCAGTGCTG CTGCTTGAGC 
TCTTCACCGT ATCCTTGCTT GGGTCCCTGT GAACCATGAG CCATGGAGTC CGGGATCCCC 
TTTCTGGTAG GATTGATGGA ATCTAATAAT AAA 
{2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

933 



- WO98/36054 



PCT/AU98/I)0085 



-90- 



(A) LENGTH: 980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID IK»:30: 
CCTGTGGTCG CCCCAGGATG CTGAACCGAA ■TCGTGGGCGG GCAGGACACG CAGGAGGGCG 
AGTGGCCCTG GCAA6TCAGC ATCCAGCGCA ACGGAAGCCA CTTCTGCGGG GGCAGCCTCA 
TCGCGGAGCA G-KGOTCCTG ACGGC-TCCGC ACTGCTTCCG CAACACCTCT GAGACGTCCC 
TGTACCAGGT CCTGCTCGGG GCAAGGCAGC TAGTGCAGCC GG6ACCACAC GCTATGTATG 
CCCGGGTGAG GCAGGTCGAG AGCAACCCCC TGTACCAGGG CACGGCCTCC AGCGCTGACG 
TGGCCCKSGT GGAGCTOGAG GCACCAG-KSC CCTTCACCAA TTACATCCTC CCCGTGTGCC 
TGCCTGACCC CTCGGTGATC TTTGAGACGG GCATGAACTG CTGGGTCACT GGCTGGGGCA 
GCCCCAGTCA GGAAGACCTC CTCCCCGAAC CGCGGATCCT GCAGAAACTC GCT6TGCCCA 
TCAtCGACAC ACCCAAGTGC AACCTCCTCT ACAGCAAAGA CACCGAGTTT GGCTACCAAC 
CCAAAACCAT CAAGAATCAC ATGCTGTGCG CCGGCTTC6A GGAGGGCAAG AAGGATGCCT 
GCAAGGGCGA CTCGGGCGGC CCCCOGGTGT GCCTCGTGGG TCAGTCGTGG CTGCAGGCGG 
GGG-KATCAG CTGGGG-KSAG GGC-TCTGCCC GCCAGAACCG CCCAGGTGTC TACATCCGTG 
TCACCGCCCA CCACAACIGG ATCCATCGGA TCATCCCCAA ACTGCAGTTC CAGCCAGCGA 
GGTTGGGCGG CCAGAAG-TOA GACCCCCOGG GCCAGGAGCC CCTTGAGCAG AGCTCTGCAC 
CCAGCCTGCC CGCCCACACC ATCCIGCTCG TCCTCCCAGC GCTGCTGTTG CACCTGTGAG 
CCCCACCAGA CTCATTTCTA AATAGCGCTC CTTCCTCCCC TCTCAAATAC CCTTATTTTA 
TTTATGTTTC TCCCAATAAA 
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CLAIMS: 

1. An isolatesd proteinaceous molecule involved in or associated with regulation of cell 
activity and/or viability comprising a sequence of amino acids encoded by a nucleotide 
sequence, at least a portion of which, is capable of being amplified by polymerase chain reaction 
(PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIAaGCIGaCAYTG3' [SEQ ID N0:1]; and 
S'ACAGAATTCAXIGGICCICCiaGT/AXTCICO' [SEQ ID NO:2]; 
or a complementary form of said primers. 

2. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising an amino acid sequence substantiaUy as set forth in SEQ ID N0:4 
or an amino acid sequence having at least 50% similarity thereto. 

3. An isolated proteinaceous molecule according to daim 1 wherein said molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:6 
or an amino acid sequence having at least 50% similarity thereto. 

4. An isolated proteinase molecule according to claim 1 wherein said molecule is a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:8 or an 
amino acid sequence having at least about 50% similarity thereto. 

5. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID NO:3 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID N0:3 under low stringency conditions at 42^. 
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6. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:5 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID NO:5 undar low stringency conditions at ATC. 

7. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:7 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set fortii in 
SEQ ID N0:7 under low stringency conditions at 42°C. 

8. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
kinase comprising an amino add sequence substantially as set forth in SEQ ID NO: 1 0 or having 
50% amino acid similarity thereto. 

9. An isolated proteinaceous molecule according to claim 1 wherein said molecule is a 
kinase comprising an amino acid sequence encoded by a nucleotide sequence substantially as 
set forth in SEQ ID NO:9 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to die nucleotide sequence set fortii in SEQ ID 
N0:9 under low stringency conditions at 42'C. 

10. An isolated nucleic acid molecule encoding a polypeptide wherein at least a portion of 
said nucleic acid molecule is capable of being amplified by polymerase chain reaction (PGR) 
using the following primers: 

5' ACAGAA1TCrGGGTIGTIACIGaGCICAYTG3' [SEQ ID NO:l]; and 
5'ACAGAATrcAXIGGICaCCiaGT/AXrCICC3' [SEQ ID N0:2]; 



or a complementary form of said primers. 
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11. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:4 
or an amino acid sequence having at least 50% similarity thereto. 

12. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:6 



or an 



amino acid sequence having at least 50% similarity thereto. 



13. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID NO:8 



or 



an amino acid sequence having at least about 50% simUarity thereto. 



14. An isolated nucleic acid molecule according to claim 10 comprising a sequence of 
nucleotides substantiafly as set forth in SEQ ID N0:3 or a nucleotide sequence having at least 
50% similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID NO:3 under low stringency conditions at 42°C. 

15. An isolated nucleic acid molecule according to claim 10 comprising a sequence of 
nucleotides substantiaUy as set forth in SEQ ID N0:5 or a nucleotide sequence having at least 
50% amilarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID N0:5 under low stringency conditions at 42'C. 

16. An isolated nucleic acid molecule according to claim 10 comprising a sequence of 
nucleotides substantiafly as set forth in SEQ ID NO:7 or a nucleotide sequence having at least 
50% similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth 
in SEQ ID N0:7 under low stringency conditions at 42*^. 



17. An isolated nucleic acid molecule according to claim 10 wherein said polypeptide is a 
kinase comprising an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 
50% amino acid similarity thereto. 
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18. An isolated nucleic acid molecule according to claim 17 comprising a sequence of 
nucleotides encoded by a nucleotide sequence substantially as set forth in SEQ ID NO:9 or a 
m^leotide sequence having at least 50% similarity thereto or a nucleotide sequence capable of 
hybridising to the nucleotide sequence set forth in SEQ ID N0:9 under low stringency 
conditions at 42X1, 

19. An isolated serine pmteinase encoded by a gene proximal to a cluster of genes of a 
mammalian chromosome. 

20. An isolated serine proteinase according to claim 19 wherein tiie mammalian 
chromosome is human chromosome 16pl3.3 or its equivalent in a non-human species. 

21. An isolated serine proteinase according to claim 20 wherein the gene cluster includes 
at least two genes having the nucleotide sequence as set forth in SEQ ID NO:3 or 5 or 28 or 
29 or 30 or a nucleotide sequence having at least 50% similarity to any one of SEQ ID N0:3 
or 5 or 28 or 29 or 30 or a nucleotide sequence capable of hybridizing to any one of die 
sequences under low stringency conditions at 42"C. 

22. An isolated serine proteinase according to claim 20 wherein said serine protemase is a 
short form of HELA2 having an amino acid sequence substantially as set forth in SEQ ID N0:4 
or an amino acid sequence having at least 50% simUarity thereto. 

23. An isolated serine proteinase according to claim 20 wherein said serine proteinase is a 
tong formof HELA2 having an amino acid sequence substantially as set forth in SEQ ID NO:6 
or an amino acid sequence having at least 50% similarity ttiereto. 

24. An isolated serine proteinase according to claim 22 encoded by a nucleotide sequence 
substantiaUy as set forth in SEQ ID N0:3 or a nucleotide sequence having at least 50% 
simihirity tiiereto or a sequence capable of hybridizing to SEQ I NO:3 under low stringency 
conditions at 42°C. 
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25. An isolated serine proteinase according to claim 23 encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:5 or a nucleotide sequence having at least 50% 
similarity thereto or a sequence capable of hybridizing to SEQ ID N0:5 under low stringency 
conditions at 42'X^. 

26. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a serine 
proteinase and corresponding to a gene proximal to a cluster of genes encoding serine 
proteinases. 

27. An isolated nucleic acid molecule according to claim 26 wherein the gene cluster 
includes at least two genes having the nucleotide sequence as set forth in SEQ ID N0:3 or 5 
or 28 or 29 or 30 or a nucleotide sequence having at least 50% similarity to any one of SEQ 
ID NaS or 5 or 28 or 29 or 30 or a nucleotide sequence capable of hybridizing to any one of 
flie sequences under low stringency conditions at 42°C. 

28. An isolated nucleic acid molecule according to claim 25 comprising a nucleotide 
sequence substantially as set forth in SEQ ID N0:3 or SEQ ID NO:5 or a nucleotide sequence 
having at least about 50% similarity to either of SEQ ID N0:3 or SEQ ID NO:5 or a nucleotide 
sequence capable of hybridizing to SEQ ID NO:3 or SEQ ID N0:5 under low stringency 
conditions at AYC 

29. An isolated kinase comprising an amino acid sequence substantially as set forth in SEQ 
ID NO: 10 or an amino acid sequence having at least about 50% similarity thereto. 

30. An isolated kinase according to claim 29 encoded by a nucleotide sequence substantially 
as set forth in SEQ ID N0:9 or a nucleotide sequence having at least 50% sinularity tiiereto or 
capable of hybridizing to SEQ ID NO:9 under low stringency conditions at ATC. 

31. A metiiod of regulating cell activity and/or viability said metiiod comprising contacting 
said cell with an activity and/or viability effective amount of a serine proteinase and/or kinase. 
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32. A method according to claim 3 1 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence, at least a portion of which, is capable of being 
ampUfied by polymerase chain reaction (PGR) using the following prinjers: 

5' ACAGAA1TCrGGGTIGTIAaGaGaCAYTG3' [SEQID N0:1]; and 
5'ACAGAATrCAXIGGICaCOaGT/AXTaCX:3' [SEQIDN0:2]; 
or a complementary form of said primers. 

33. A method according to claim 31 wherein the serine proteinase comprises an amino acid 
sequence substantially as set forth in SEQ ID NO:4 or an amino acid sequence having at least 
50% similarity thereto. 

34. A method according to daim 31 wherein the serine proteinase comprises an amino acid 
sequence substantially as set forth in SEQ ID N0:6 or an amino acid sequence having at least 
50% similarity thereto. 

35. A method according to claim 31 wherein the serine proteinase comprises an amino acid 
sequence substantiaUy as set forth in SEQ ID NO:8 or an amino acid sequence having at least 
about 50% similarity thereto. 

36. A method according to claim31 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence substantially as set forth m SEQ ID N0:3 or a 
nucleotide sequence capable of hybridising to the sequence set forth in SEQ ID NO:3 under low 
stringency conditions at 42°C. 

37. A iiBthod according to claim 3 1 wherein the serine proteinase comprises a sequence of 
amino adds encoded by a nucleotide sequence substantially as set forth in SEQ ID N0:5 or a 
nudeotide sequence having at least 50% simUarity thereto or a nucleotide sequence capable of 
hybridising to the sequence set forth in SEQ ID N0:5 under low stringency conditions at 42»C. 
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38. A method accordii^ to daim 3 1 wherein the serine proteinase comprises a sequence of 
amino acids encoded by a nucleotide sequence substantially as set forth in SEQ ID N0:7 or a 
nucleotide sequence having at least 50% similarity thereto or a nucleotide sequence capable of 
hybridising to the sequence set forth in SEQ ID N0:7 under low stringency conditions at 42°C. 

39. A method according to claim 31 wherein the kinase comprises an amino acid sequence 
substantiaUy as set forth in SEQ ID NO:10 or having 50% amino acid similarity thereto. 

40. A method according to claim 3 1 wherein the kmase comprises an amino acid sequence 
encoded by a nucleotide sequence substantially as set forth in SEQ ID NO:9 or a nucleotide 
sequence having at least 50% similarity thereto or a nucleotide sequence capable of hybridising 
to the nucleotide sequence set forth in SEQ ID N0:9 under low stringency conditions at 42°C. 

41. A oethodofnKKhilating fertility in a raanmial said inethod comprising moddat^^ 

of HELA2 wherein increasing levels of HELA2 facilitates sperm maturation and development 

42. A method according to claim 41 wherein fertility is enhanced by introducing 
recombinant HELA2. 

43. A method according to claim 41 wherein fertility is reduced by down regulating 
expression of the HELA2 gene. 

44. A composition comprising a serine proteinase and/or kinase capable of regulating cell 
activity and/or viabUity and one or more pharmaceutically acceptable carriers and/or diluents. 

45. A composition according to claim 44 wherein the serine proteinase is HELA2 or a 
functional derivative thereof. 

46. An isolated antibody capable of interacting wiOi a proteinaceous molecule involved in 
or associated with regulation of ceU activity and/or viability comprising a sequence of amino 
acids encoded by a nucleotide sequence, at least a portion of which, is capable of bang 
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amplified by polymerase chain reaction (PGR) using the foUowing primers: 

5' ACAGAATTCrGGGTIGTIAaGaGCICAYTG3' [SEQIDNO:ll;and 
5'ACAGAATrCAXIGGICacaC7GT/AXTaCC3' [SEQIDN0:21; 

or a complementary form of said primras. 

47. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising an amino acid sequence substantially as set fortii in SEQ ID NO:4 
or an amino acid sequence having at least 50% simUarity thereto. 

48. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:6 
or an amino acid sequence having at least 50% siimlarity tiiereto. 

49. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:8 
or an amino acid sequence having at least about 50% sinularity tiiereto. 

50. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantially as set forth in SEQ ID N0:3 or a nucleotide sequence having at least 50% 
similarity tiiereto or a nucleotide sequence capable of hybridising to die sequence set forth in 
SEQ ID N0:3 under low stringency conditions at 42°C. 

51. An isolated antibody according to claun 46 wherein said proteinaceous molecule is a 
serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantiaUy as set forth fa SEQ ID N0:5 or a nucleotide sequence havfag at least 50% 
similarity tiiereto or a nucleotide sequence capable of hybridising to tiie sequence set forth in 
SEQ ID N0:5 under low stringency conditions at 42°C. 
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52. Anisolated antibody according to claim 46 wherein said proteinaceous said molecule 
is a serine proteinase comprising a sequence of amino acids encoded by a nucleotide sequence 
substantiaUy as set forth in SEQ ID NO:7 or a nucleotide sequence having at least 50% 
similarity thereto or a nucleotide sequence capable of hybridising to the sequence set forth in 
SEQ ID N0:7 under low stringency conditions at 42°C. 

53. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
kinase comprising an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having 
50% amino acid similarity thereto. 

54. An isolated antibody according to claim 46 wherein said proteinaceous molecule is a 
kinase comprising an amino acid sequence encoded by a nucleotide sequence substantially as. 
set forth in SEQ ID N0:9 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID 
NO:9 under low stringency conditions at 42"C. 

55. An antagonist or agonist to the isolated proteinaceous molecule according to any one 
of claims 1 to 9. 

56. A method of determining a predisposition for or the presence of a cancer, said method 
comprising determining the presence of a nucleotide sequence encoding a proteinaceous 
molecule according to any one of claims 1 to 9. 

57. A method according to claim 56 wherein the nucleotide sequence encodes a 
polypeptide wherein at least aportion of said nucleotide sequence is capable of being amplified 
by polymerase chain reaction (PGR) using the following primers: 

5' ACAGAATTCTGGGTIGTIAaGCIGaCAYTG3' [SEQ ID NO: 1]; and 



5'ACAGAATTCAXIGGICCICaaGT/AXraCC3' [SEQID N0:2]; 
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or a complementary fonn of said primers. 

58. A method accoiding to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:4 or an 
amino acid sequence having at least 50% sunUarity thereto. 

59. A method accoitling to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase comprising an amino acid sequence substantially as set forth in SEQ ID N0:6 or an 
amino acid sequence having at least 50% sunilarity thereto. 

60. A mediod according to claim 57 wherein said nucleotide sequence encodes a serine 
proteinase comprising an amino acid sequence substantiaUy as set forth in SEQ ID N0:8 or an 
amino acid sequence having at least about 50% similarity thereto. 

61 . A method according to claim 57 wherein said nucleotide sequence is as substantiaUy set 
forth in SEQ ID NO:3 or is a nucleotide sequence having at least 50% similarity thereto or a 
MKdeotide sequence capable of hybridising to die sequence set fordi in SEQ ID N0:3 under low 
stringuicy conditions at 42"C. 

62. A metiwd according to claim 57 wherm said nucleotide sequence is as substantiaUy set 
fortii in SEQ ID N0:5 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to tiie sequence set fortii in SEQ ID N0:5 under low 
stringency conditions at 42°C. 

63. A method according to claim 57 wherein said nucleotide sequence is as substantially set 
fortii in SEQ ID N0:7 or a nucleotide sequence having at least 50% simUarity tiiereto or a 
nucleotide sequence capable of hybridising to tiie sequence set forth in SEQ ID N0:7 under low 
stringency conditions at42°C. 

64. A meflMd according to claim 57 wherein said nucleotide sequence is as substantially set 
forth in SEQ ID NO:10 or having 50% amino acid simUarity tiiereto. 
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65. A method according to claim 57 wherein said nucleotide sequence is as substanUally set 
forth in SEQ ID NO:9 or a nucleotide sequence having at least 50% similarity thereto or a 
nucleotide sequence capable of hybridising to the nucleotide sequence set forth in SEQ ID 
NO:9 under low stringency conditions at 42°C. 
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FIG 2A 
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A. Expression of recombinant Testisin in E. coU. 
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FIG 18 (AI) 
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